Your search
Results 27 resources
-
Dynamic Dialects contains an articulatory video-based corpus of speech samples from world-wide accents of English. Videos in this corpus contain synchronised audio, ultrasound-tongue-imaging video and video of the moving lips. We are continuing to augment the database. The website contains three main resources: - A clickable Accent Map: clicking on points of the map will open up links to...
-
These transcripts and video files are samples of Spanish and English caregiver (almost always mother)-child interaction collected at child ages 2 ½, 3, and 3 ½ years as part of a 10-year longitudinal study of the language and literacy development of U.S.-born children raised in Spanish-speaking homes. Each recording is approximately 30 minutes in length. The caregiver and target child are...
-
The MSP-Podcast corpus contains speech segments from podcast recordings which are perceptually annotated using crowdsourcing. The collection of this corpus is an ongoing process. Version 1.11 of the corpus has 151,654 speaking turns (237 hours and 56 mins). The proposed partition attempts to create speaker-independent datasets for Train, Development, Test1, Test2, and Test3 sets.
-
This dataset contains 350 parallel utterances spoken by 10 native Mandarin speakers, and 10 English speakers with 5 emotional states (neutral, happy, angry, sad and surprise). The transcripts are provided.
-
The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English, both spoken and written, from the late twentieth century. Access the data here: https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2554
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
-
A sound vocabulary and dataset AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By...
-
This CSTR VCTK Corpus includes speech data uttered by 110 English speakers with various accents. Each speaker reads out about 400 sentences, which were selected from a newspaper, the rainbow passage and an elicitation paragraph used for the speech accent archive.
-
VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
-
Common Voice is a project to help make voice recognition open to everyone. Developers need an enormous amount of voice data to build voice recognition technologies, and currently most of that data is expensive and proprietary. We want to make voice data freely and publicly available, and make sure the data represents the diversity of real people. Together we can make voice recognition better for everyone.
-
The West Yorkshire Regional English Database (WYRED) consists of approximately 200 hours of high-quality audio recordings of 180 West Yorkshire (British English) speakers. All participants are male between the ages of 18-30, and are divided evenly (60 per region) across three boroughs within West Yorkshire (Northern England): Bradford, Kirklees, and Wakefield. Speakers participated in four...
Explore
Audio
-
Language
- Arabic (1)
- Bi-/Multilingual (1)
- English (19)
- French (1)
- L2+ (1)
- Language Learning (2)
- Mandarin (3)
- Multiple (2)
- Multiple (2)
- Spanish (1)
-
Accent/Region
(10)
- American English (1)
- Arabic (1)
- Australian English (2)
- British English (6)
- World Englishes (1)
- Child Speech (1)
- Conversation (7)
- Electroglottography / Electrolaryngography (1)
- Emotional Speech (2)
- Forensic (5)
- Multi-Speaker (15)
- Multi-Style (1)
- Pathological (1)
- Singing (1)
- Speech in Noise (2)
Speech Production & Articulation
- Brain Imaging (1)
- MRI (2)
- Ultrasound (2)
- Video (1)
Teaching Resources
Vocal Anatomy
- Larynx and Glottis (1)
- Vocal Tract (1)
Tags
- audio data (23)
- adult (15)
- English (14)
- read speech (11)
- male (10)
- female (9)
- transcribed (8)
- interview (5)
- spontaneous speech (5)
- conversation (4)
- forensic (3)
- telephone (3)
- Mandarin (3)
- open-source (2)
- English accents (2)
- British (2)
- perceptually annotated (2)
- ultrasound tongue imaging (UTI) (2)
- Australian (2)
- MRI (2)
- rtMRI (2)
- Newcastle (2)
- phonetic labels (2)
- Southern standard British English (SSBE) (1)
- map task (1)
- speech recognition (1)
- rainbow passage (1)
- labelled (1)
- non-speech (1)
- singing (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- angry (1)
- emotional speech (1)
- happy (1)
- sad (1)
- surprise (1)
- podcast (1)
- Spanish (1)
- bilingual (1)
- child speech (1)
- child-centered audio (1)
- mother-child interaction (1)
- accent map (1)
- lip video (1)
- teaching resource (1)
- Arabic (1)
- accent variability (1)
- dialect variability (1)
- older adult (1)
- sociophonetic (1)
- arousal (1)
- dominance (1)
- valence (1)
- Putonghua (1)
- French (1)
- volumetric MRI (1)
- Derby (1)
- Leeds (1)
- Manchester (1)
- York (1)
- audiovisual (1)
- digits (1)
- video (1)
- whisper (1)
- American English (1)
- Ohio (1)
- British English (1)
- L2 English (1)
- L2 speech (1)
- annotated (1)
- language learning (1)
- electroglottography (EGG) (1)
- intraoral pressure (1)
- validation (1)
- dysarthria (1)
- pathological speech (1)
- speech-language pathology (1)
- vowels (1)
- multi-language (1)
- brain activity (1)
- fMRI (1)
- vocal imitation (1)
Resource type
- Dataset (17)
- Journal Article (3)
- Report (1)
- Software (1)
- Web Page (5)