Your search
Results 36 resources
-
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).
-
Abstract The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus...
-
These transcripts and video files are samples of Spanish and English caregiver (almost always mother)-child interaction collected at child ages 2 ½, 3, and 3 ½ years as part of a 10-year longitudinal study of the language and literacy development of U.S.-born children raised in Spanish-speaking homes. Each recording is approximately 30 minutes in length. The caregiver and target child are...
-
This dataset contains 350 parallel utterances spoken by 10 native Mandarin speakers, and 10 English speakers with 5 emotional states (neutral, happy, angry, sad and surprise). The transcripts are provided.
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
-
VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
Explore
Audio Data
- Accents (4)
- Child Speech (4)
- Conversation (9)
- Directed Speech (1)
- Emotional Speech (1)
-
Language
(7)
- African Languages (1)
- Bi-/Multilingual (1)
- English (3)
- French (1)
- Korean (1)
- Mandarin (1)
- Multiple (1)
- Spanish (1)
- Pathological (2)
- Speech in Noise (5)
Derived & Measured Data
Speech Production Data
- MRI (1)
-
Vocal Anatomy
(1)
- Vocal Tract (1)
Tags
- transcribed
- audio data (31)
- English (9)
- spontaneous speech (8)
- conversation (7)
- female (4)
- male (4)
- read speech (4)
- child speech (4)
- speech in noise (4)
- adult (4)
- French (3)
- British English (3)
- Mandarin (2)
- Spanish (2)
- speech-language pathology (2)
- multi-language (2)
- Sudanese (2)
- Nepali (2)
- Javanese (2)
- Bengali (2)
- older adult (2)
- American English (2)
- phonetic labels (2)
- open-source (1)
- speech recognition (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- angry (1)
- emotional speech (1)
- happy (1)
- sad (1)
- surprise (1)
- bilingual (1)
- child-centered audio (1)
- mother-child interaction (1)
- Amyotrophic Lateral Sclerosis (ALS) (1)
- Down syndrome (1)
- Parkinson's disease (1)
- annotated (1)
- cerebral palsy (1)
- stroke (1)
- stutter (1)
- Lombard speech (1)
- clear speech (1)
- computer-directed speech (1)
- infant-directed speech (1)
- non-native-directed speech (1)
- formant measurement (1)
- phone duration (1)
- phone-level alignment (1)
- pitch (1)
- Chinese (1)
- Amharic (1)
- Swahili (1)
- Wolof (1)
- Korean (1)
- Sinhala (1)
- Khmer (1)
- Afrikaans (1)
- Sesotho (1)
- Setswana (1)
- isiXhosa (1)
- L2 English (1)
- Spanish accent (1)
- Czech (1)
- MRI (1)
- real-time MRI (rtMRI) (1)
- volumetric MRI (1)
- Ohio (1)
- Newcastle (1)
- interview (1)
- sociolinguistic (1)
- sociophonetic (1)
- African (1)
- Cameroon (1)
- Chad (1)
- Congo (1)
- Gabon (1)
- Niger (1)
Resource type
- Dataset (32)
- Journal Article (1)
- Software (1)
- Web Page (2)