Your search
Results 46 resources
-
English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a result, there are many varieties of English. Although the great many advances in English automatic speech recognition (ASR) over the past decades, results are usually reported based on test datasets which fail to represent the diversity of...
-
The Sociolinguistic Archive and Analysis Project, at North Carolina State University, is an interactive web-based archive of sociolinguistic recordings, with integrated media playing and annotation features, as well as phonetic analysis and corpus analysis tools designed for enabling and improving empirical linguistic inquiry. The archive continues to grow over time. It currently contains (as...
-
A multi-speaker corpus of ultrasound images of the tongue and video images of the lips The Tongue and Lips (TaL) corpus is a multi-speaker corpus of ultrasound images of the tongue and video images of lips. This corpus contains synchronised imaging data of extraoral (lips) and intraoral (tongue) articulators from 82 native speakers of English. The TaL corpus consists of two datasets: - TaL1...
-
This collection contains behavioural and brain activation data from 3 laboratory studies of speech imitation. Each of the three studies involved behavioural and imaging (MRI) test sessions in which participants were familiarised with novel auditory speech targets, and were asked to imitate them as closely as possible. Across the three studies, there were variations in the type of sounds...
-
This database includes clinically-verified 208 voice samples, from 150 pathological voices and 58 healthy voices. The database also includes information such as gender, age, pathology, lifestyle habits (e.g. smoking, alcohol and coffee consummation), occupational status, and the results of two specific medical questionnaires: the Voice Handicap Index (VHI) and Reflux Symptom Index...
-
For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic...
-
Coarticulation, one of the central issues in experimental phonetic research, refers to the articulatory overlap of neighbouring sounds, resulting in acoustic and perceptual modifications of these sounds. Studies of the development of coarticulatory patterns in children have produced conflicting results concerning adult-child differences. This research compares coarticulatory properties of...
-
SVQTD (Singing Voice Quality and Technique Database) is a classical tenor singing dataset collected from YouTube, it is mainly used to support supervised machine learning performing paralinguistic singing attribute recognition tasks. In SVQTD, there are nearly 4000 vocal solo segments with $4 - 20$ seconds long, totaling 10.7 hours. These segmenets are partitioned from 400 audios of 6 famous...
-
This dataset contains simultaneous recordings of electroglottography (EGG recorded with Glottal Enterprises EG2-PCX2), unfiltered audio, and intraoral pressure (recorded with Glottal Enterprises PG-60) from 14 subjects. It is meant to facilitate the validation of physical models of glottal control during voicing, in which the glottal/source waveform for speech is controlled by a combination of...
-
Currently available data set consists of the DICOM-datafiles and corresponding sound samples for all the finnish vowels. Some derivatives obtained from the image and sound data are also provided, this includes the surface models for the vowels.
-
We introduce the Speak & Improve Corpus 2025, a dataset of L2 learner English data with holistic scores and language error annotation, collected from open (spontaneous) speaking tests on the Speak & Improve learning platform. The aim of the corpus release is to address a major challenge to developing L2 spoken language processing systems, the lack of publicly available data with high-quality...
-
A dataset of ultrasound and audio recorded with children with speech sound disorders. The Ultrax 2020 dataset is a collection of ultrasound tongue imaging and audio data, gathered from children with speech sound disorders by speech and language therapists in hospital environments. We recorded data with 43 English-speaking children, but only 37 gave consent to share their data. These are 11...
-
A dataset of ultrasound and audio recorded with children with cleft lip and palate The cleft dataset is a collection of ultrasound tongue imaging and audio data, gathered from children with cleft lip and palate by a research speech and language therapist working in a hospital environment. We recorded data with 39 English-speaking children, but only 29 gave consent to share their data. These...
-
A dataset of ultrasound and audio recordings from children with speech sound disorders. The UltraPhonix dataset contains 20 speakers (16 male, 4 female), aged 6-13 years.
-
A dataset of ultrasound and audio recordings from children with speech sound disorders. The UXSSD dataset contains 8 speakers (2 female and 6 male), aged 5-10 years.
Explore
Audio
-
Accent/Region
(10)
- American English (2)
- Arabic (1)
- Australian English (2)
- British English (4)
- World Englishes (2)
- Child Speech (8)
- Conversation (6)
- Electroglottography / Electrolaryngography (1)
- Emotional Speech (3)
- Forensic (4)
-
Language
(23)
- Arabic (1)
- Bi-/Multilingual (1)
- English (15)
- French (1)
- L2+ (1)
- Language Learning (2)
- Mandarin (2)
- Multiple (1)
- Multiple (2)
- Spanish (1)
- Multi-Speaker (14)
- Multi-Style (1)
- Pathological (8)
- Singing (2)
- Speech in Noise (1)
- Synthetic Speech (1)
Derived & Measured Data
- Vocal Tract (1)
Speech Production & Articulation
- Articulography (2)
- Brain Imaging (1)
- MRI (9)
- Ultrasound (10)
- Video (3)
Teaching Resources
Vocal Anatomy
- Larynx and Glottis (1)
- Vocal Tract (8)
Tags
- audio data
- adult (23)
- male (21)
- read speech (19)
- female (18)
- English (17)
- child speech (9)
- spontaneous speech (9)
- transcribed (8)
- MRI (8)
- speech-language pathology (8)
- ultrasound (7)
- interview (6)
- video (5)
- articulatory data (5)
- real-time MRI (rtMRI) (5)
- conversation (5)
- volumetric MRI (4)
- forensic (3)
- telephone (3)
- British (3)
- perceptually annotated (3)
- American English (3)
- speech production (3)
- vowels (3)
- ultrasound tongue imaging (UTI) (3)
- annotated (3)
- speech sound disorder (3)
- open-source (2)
- English accents (2)
- singing (2)
- angry (2)
- audiovisual (2)
- emotional speech (2)
- happy (2)
- older adult (2)
- sad (2)
- articulation (2)
- multimodal (2)
- electromagnetic articulography (EMA) (2)
- vocal tract shape (2)
- lip video (2)
- teaching resource (2)
- sociophonetic (2)
- Australian (2)
- Mandarin (2)
- rtMRI (2)
- L2 English (2)
- pathological speech (2)
- Southern standard British English (SSBE) (1)
- map task (1)
- deepfake (1)
- logical access (1)
- physical access (1)
- spoof (1)
- synthetic speech (1)
- speech recognition (1)
- rainbow passage (1)
- labelled (1)
- non-speech (1)
- disgust (1)
- podcast (1)
- Spanish (1)
- bilingual (1)
- child-centered audio (1)
- mother-child interaction (1)
- consonants (1)
- jaw scans (1)
- accent map (1)
- International Phonetic Alphabet (IPA) (1)
- Arabic (1)
- accent variability (1)
- dialect variability (1)
- arousal (1)
- dominance (1)
- valence (1)
- Putonghua (1)
- French (1)
- Derby (1)
- Leeds (1)
- Manchester (1)
- Newcastle (1)
- York (1)
- digits (1)
- whisper (1)
- Ohio (1)
- phonetic labels (1)
- longitudinal (1)
- typically developing (1)
- L2 speech (1)
- language learning (1)
- DICOM (1)
- electroglottography (EGG) (1)
- intraoral pressure (1)
- validation (1)
- back placement (1)
- chest resonance (1)
- classical (1)
- front placement (1)
- head resonance (1)
- open throat (1)
- roughness (1)
- tenor (1)
- vibrato (1)
- dysarthria (1)
- Amyotrophic Lateral Sclerosis (ALS) (1)
- Down syndrome (1)
- Parkinson's disease (1)
- cerebral palsy (1)
- stroke (1)
- stutter (1)
- cleft (1)
- Scottish English (1)
- coarticulation (1)
- within-speaker variability (1)
- multi-language (1)
- held vowel (1)
- brain activity (1)
- fMRI (1)
- vocal imitation (1)
- professional voice (1)
- silent speech (1)
- sociolinguistic (1)
- World Englishes (1)
- dyadic (1)
Resource type
- Dataset (38)
- Journal Article (3)
- Report (1)
- Web Page (4)