Your search
Results 19 resources
-
This dataset contains simultaneous recordings of electroglottography (EGG recorded with Glottal Enterprises EG2-PCX2), unfiltered audio, and intraoral pressure (recorded with Glottal Enterprises PG-60) from 14 subjects. It is meant to facilitate the validation of physical models of glottal control during voicing, in which the glottal/source waveform for speech is controlled by a combination of...
-
We introduce the Speak & Improve Corpus 2025, a dataset of L2 learner English data with holistic scores and language error annotation, collected from open (spontaneous) speaking tests on the Speak & Improve learning platform. The aim of the corpus release is to address a major challenge to developing L2 spoken language processing systems, the lack of publicly available data with high-quality...
-
DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.
-
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).
-
The MSP-AVW is an audiovisual whisper corpus for audiovisual speech recognition purpose. The MSP-AVW corpus contains data from 20 female and 20 male speakers. For each subject, three sessions are recorded consisting of read sentences, isolated digits and spontaneous speech. The data is recorded under neutral and whisper conditions. The corpus was collected in a 13ft x 13ft ASHA certified...
-
This 3-year project investigates language change in five urban dialects of Northern England—Derby, Newcastle, York, Leeds and Manchester. Data collection method: Linguistic analysis of speech data (conversational, word list) from samples of different northern English urban communities. Data collection consisted of interviews, which included (1) some structured questions about the interviewee...
-
Multi-laboratory evaluation of forensic voice comparison systems under conditions reflecting those of a real forensic case. There is increasing pressure on forensic laboratories to validate the performance of forensic analysis systems before they are used to assess strength of evidence for presentation in court (including pressure from the recently released report by the President’s Council...
-
Forensic database of voice recordings of 500+ Australian English speakers (AusEng 500+). This database contains 3899 recordings totalling 310 hours of speech from 555 Australian-English speakers. 324 female speakers: - 91 recorded in one recording session - 69 recorded in two separate recording sessions - 159 recorded in three recording sessions - 5 recorded in more than three recording...
-
The MSP-Conversation corpus contains interactions annotated with time-continuous emotional traces for arousal (calm to active), valence (negative to positive), and dominance (weak to strong). Time-continuous annotations offer the flexibility to explore emotional displays at different temporal resolutions while leveraging contextual information. Release 1.0 contains 74 conversations with...
-
Dynamic Dialects contains an articulatory video-based corpus of speech samples from world-wide accents of English. Videos in this corpus contain synchronised audio, ultrasound-tongue-imaging video and video of the moving lips. We are continuing to augment the database. The website contains three main resources: - A clickable Accent Map: clicking on points of the map will open up links to...
-
These transcripts and video files are samples of Spanish and English caregiver (almost always mother)-child interaction collected at child ages 2 ½, 3, and 3 ½ years as part of a 10-year longitudinal study of the language and literacy development of U.S.-born children raised in Spanish-speaking homes. Each recording is approximately 30 minutes in length. The caregiver and target child are...
-
The MSP-Podcast corpus contains speech segments from podcast recordings which are perceptually annotated using crowdsourcing. The collection of this corpus is an ongoing process. Version 1.11 of the corpus has 151,654 speaking turns (237 hours and 56 mins). The proposed partition attempts to create speaker-independent datasets for Train, Development, Test1, Test2, and Test3 sets.
-
This dataset contains 350 parallel utterances spoken by 10 native Mandarin speakers, and 10 English speakers with 5 emotional states (neutral, happy, angry, sad and surprise). The transcripts are provided.
-
The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English, both spoken and written, from the late twentieth century. Access the data here: https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2554
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
Explore
Audio
-
Language
- English
- Bi-/Multilingual (1)
- L2+ (1)
- Language Learning (1)
- Mandarin (1)
- Multiple (1)
- Spanish (1)
-
Accent/Region
(9)
- American English (1)
- Australian English (2)
- British English (6)
- World Englishes (1)
- Child Speech (1)
- Conversation (7)
- Electroglottography / Electrolaryngography (1)
- Emotional Speech (2)
- Forensic (4)
- Multi-Speaker (13)
- Multi-Style (1)
- Singing (1)
- Speech in Noise (2)
Speech Production & Articulation
- Ultrasound (1)
- Video (1)
Teaching Resources
Vocal Anatomy
Tags
- audio data (15)
- English (14)
- adult (10)
- read speech (8)
- male (7)
- female (5)
- transcribed (5)
- interview (4)
- spontaneous speech (4)
- conversation (4)
- forensic (3)
- telephone (2)
- English accents (2)
- British (2)
- perceptually annotated (2)
- Australian (2)
- Newcastle (2)
- phonetic labels (2)
- Southern standard British English (SSBE) (1)
- map task (1)
- rainbow passage (1)
- labelled (1)
- non-speech (1)
- singing (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- Mandarin (1)
- angry (1)
- emotional speech (1)
- happy (1)
- sad (1)
- surprise (1)
- podcast (1)
- Spanish (1)
- bilingual (1)
- child speech (1)
- child-centered audio (1)
- mother-child interaction (1)
- accent map (1)
- lip video (1)
- teaching resource (1)
- ultrasound tongue imaging (UTI) (1)
- arousal (1)
- dominance (1)
- valence (1)
- Derby (1)
- Leeds (1)
- Manchester (1)
- York (1)
- audiovisual (1)
- digits (1)
- video (1)
- whisper (1)
- American English (1)
- Ohio (1)
- British English (1)
- L2 English (1)
- L2 speech (1)
- annotated (1)
- language learning (1)
- electroglottography (EGG) (1)
- intraoral pressure (1)
- validation (1)
Resource type
- Dataset (12)
- Journal Article (1)
- Report (1)
- Software (1)
- Web Page (4)