Your search
Results 8 resources
-
For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic...
-
DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.
-
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).
-
Abstract The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus...
-
These transcripts and video files are samples of Spanish and English caregiver (almost always mother)-child interaction collected at child ages 2 ½, 3, and 3 ½ years as part of a 10-year longitudinal study of the language and literacy development of U.S.-born children raised in Spanish-speaking homes. Each recording is approximately 30 minutes in length. The caregiver and target child are...
-
This dataset contains 350 parallel utterances spoken by 10 native Mandarin speakers, and 10 English speakers with 5 emotional states (neutral, happy, angry, sad and surprise). The transcripts are provided.
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
-
VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
Explore
Audio
- Language
-
Accent/Region
(2)
- American English (1)
- British English (1)
- Child Speech (1)
- Conversation (3)
- Emotional Speech (1)
- Multi-Speaker (4)
- Speech in Noise (1)
Speech Production & Articulation
- MRI (1)
Vocal Anatomy
- Vocal Tract (1)
Tags
- transcribed
- audio data (5)
- English (5)
- female (3)
- male (3)
- adult (3)
- phonetic labels (2)
- open-source (1)
- speech recognition (1)
- environmental noise (1)
- noisy audio (1)
- read speech (1)
- reverberation (1)
- Mandarin (1)
- angry (1)
- emotional speech (1)
- happy (1)
- sad (1)
- surprise (1)
- Spanish (1)
- bilingual (1)
- child speech (1)
- child-centered audio (1)
- mother-child interaction (1)
- French (1)
- MRI (1)
- rtMRI (1)
- volumetric MRI (1)
- American English (1)
- Ohio (1)
- conversation (1)
- British English (1)
- Newcastle (1)
- multi-language (1)
Resource type
- Dataset (4)
- Journal Article (1)
- Software (1)
- Web Page (2)