Your search
Results 30 resources
-
Participants English participants were 49 young adults (30 females, mean age=21.3, SD=3.6) with no history of psychiatric, neurological or other medical illness that might compromise cognitive functions. They self-identified as native English speakers, and strictly qualified as right-handed on the Edinburgh handedness inventory. All participants were paid, and gave written informed consent...
-
This repository introduces: 🌀 ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts 🔥 Key Features 3000+ hours of synthetic speech Diverse Distribution Shifts: The dataset spans 7 key distribution shifts, including: 📖 Reading Style 🎙️ Podcast 🎥 YouTube 🗣️ Languages (Three different languages) 🌎 Demographics (including variations in age, accent, and gender) Multiple...
-
The In-the-Wild dataset contains real and synthetic speech recordings of 58 celebrities and politicians, collected from online videos. It provides a realistic benchmark for testing how well audio deepfake detection models generalize beyond laboratory data such as ASVspoof. Task: Audio Classification (Deepfake / Genuine) Languages: English Modality: Audio Size: 37.9 hours total 17.2 hours fake 20.7 hours real
-
This speech corpus contains recordings for 104 monolingual native southern British English speakers aged between 8 and 85 years old while they engaged in a problem-solving picture-based ‘spot the difference’ task (Diapix) with a conversational partner in four listening conditions. In NORM (quiet, no masking), participants heard each other normally. In SPSN (speech-shaped noise), participants...
-
This collection contains the quantitative data resulting from the analysis of the elderLUCID audio corpus – a set of speech recordings collected for 83 adults aged 19 to 84 years inclusive. Recordings were made while participants carried out two types of collaborative tasks with a conversational partner who was a young adult of the same sex: (1) a ‘spot the difference’ picture task (‘diapix’)...
-
Fully-annotated corpus of spontaneous speech dialogues for children. Diapix task recorded as a stereo wav files with one speaker per channel. 96 children aged between 9 to 14 years old Non-bilingual native Southern British English speakers
-
The Nijmegen Corpus of Spanish English (NCSE) contains 38.5 hours of high-quality recordings of English speech produced by 34 native Spanish speakers in interaction with two native Dutch confederates. The NCSE contains a formal and an informal recording for each Spanish speaker. The speech has been orthographically transcribed.
-
A multi-speaker corpus of ultrasound images of the tongue and video images of the lips The Tongue and Lips (TaL) corpus is a multi-speaker corpus of ultrasound images of the tongue and video images of lips. This corpus contains synchronised imaging data of extraoral (lips) and intraoral (tongue) articulators from 82 native speakers of English. The TaL corpus consists of two datasets: - TaL1...
-
We introduce the Speak & Improve Corpus 2025, a dataset of L2 learner English data with holistic scores and language error annotation, collected from open (spontaneous) speaking tests on the Speak & Improve learning platform. The aim of the corpus release is to address a major challenge to developing L2 spoken language processing systems, the lack of publicly available data with high-quality...
-
The data deposited are taken from fieldwork recordings undertaken with speakers from the three fieldwork sites, Newcastle, Sunderland and Middlesbrough in the Northeast of England. From each locality, 40 informants were recorded, giving a total of 120 informants. The key data in the file All_formants_July_2018.xlsx are vowel formant frequency measurements, in Hertz, for the peripheral...
-
DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.
-
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).
-
The MSP-AVW is an audiovisual whisper corpus for audiovisual speech recognition purpose. The MSP-AVW corpus contains data from 20 female and 20 male speakers. For each subject, three sessions are recorded consisting of read sentences, isolated digits and spontaneous speech. The data is recorded under neutral and whisper conditions. The corpus was collected in a 13ft x 13ft ASHA certified...
-
This 3-year project investigates language change in five urban dialects of Northern England—Derby, Newcastle, York, Leeds and Manchester. Data collection method: Linguistic analysis of speech data (conversational, word list) from samples of different northern English urban communities. Data collection consisted of interviews, which included (1) some structured questions about the interviewee...
Explore
Audio Data
-
Accent/Region
(3)
- British English (2)
- World Englishes (1)
- Accents (3)
- Child Speech (3)
- Conversation (10)
- Emotional Speech (3)
- Forensic (2)
-
Language
(9)
- Bi-/Multilingual (1)
- English (9)
- L2+ (1)
- Language Learning (1)
- Mandarin (1)
- Spanish (1)
- Speech in Noise (6)
- Synthetic Speech (2)
Derived & Measured Data
Software, Processing & Utilities
Speech Perception Data
- Brain Imaging (1)
Speech Production Data
- Articulography (2)
- MRI (6)
- Ultrasound (2)
- Video (3)
-
Vocal Anatomy
(6)
- Mandible and Maxilla (1)
- Vocal Tract (6)
Teaching Resources
Tags
- English
- audio data (23)
- adult (17)
- read speech (11)
- male (11)
- female (9)
- transcribed (9)
- conversation (8)
- spontaneous speech (7)
- real-time MRI (rtMRI) (5)
- articulatory data (4)
- British English (4)
- magnetic resonance imaging (MRI) (4)
- English accents (3)
- British (3)
- perceptually annotated (3)
- child speech (3)
- video (3)
- Newcastle (3)
- interview (3)
- speech in noise (3)
- Mandarin (2)
- angry (2)
- emotional speech (2)
- happy (2)
- sad (2)
- American English (2)
- electromagnetic articulography (EMA) (2)
- speech production (2)
- L2 English (2)
- older adult (2)
- synthetic speech (2)
- articulation (2)
- Australian (2)
- forensic (2)
- individual variability (2)
- phonetic labels (2)
- rainbow passage (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- surprise (1)
- podcast (1)
- Spanish (1)
- bilingual (1)
- child-centered audio (1)
- mother-child interaction (1)
- MRI (1)
- segmentation (1)
- audiovisual (1)
- digits (1)
- whisper (1)
- Non-native speech (1)
- adaptation (1)
- diapix (1)
- Middlesbrough (1)
- Sunderland (1)
- L2 speech (1)
- annotated (1)
- language learning (1)
- professional voice (1)
- silent speech (1)
- ultrasound (1)
- Spanish accent (1)
- deepfake (1)
- Japanese (1)
- multimodal (1)
- volumetric MRI (1)
- dentition (1)
- mandible (1)
- maxilla (1)
- International Phonetic Alphabet (IPA) (1)
- lip video (1)
- teaching resource (1)
- ultrasound tongue imaging (UTI) (1)
- telephone (1)
- Derby (1)
- Leeds (1)
- Manchester (1)
- York (1)
- Ohio (1)
- Chinese (1)
- French (1)
- functional magnetic resonance imaging (fMRI) (1)
- speech perception (1)
Resource type
- Dataset (23)
- Journal Article (2)
- Report (1)
- Software (1)
- Web Page (3)