Your search
Results 18 resources
-
A sound vocabulary and dataset AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By...
-
This CSTR VCTK Corpus includes speech data uttered by 110 English speakers with various accents. Each speaker reads out about 400 sentences, which were selected from a newspaper, the rainbow passage and an elicitation paragraph used for the speech accent archive.
-
Common Voice is a project to help make voice recognition open to everyone. Developers need an enormous amount of voice data to build voice recognition technologies, and currently most of that data is expensive and proprietary. We want to make voice data freely and publicly available, and make sure the data represents the diversity of real people. Together we can make voice recognition better for everyone.
Explore
Audio Data
-
Language
- African Languages (1)
- Bi-/Multilingual (1)
- English (11)
- French (1)
- German (1)
- Korean (1)
- L2+ (1)
- Language Learning (1)
- Mandarin (2)
- Multiple (2)
- Spanish (1)
-
Accent/Region
(2)
- British English (2)
- Child Speech (1)
- Conversation (2)
- Electroglottography / Electrolaryngography (1)
- Emotional Speech (2)
- Pathological (1)
- Singing (1)
- Speech in Noise (1)
Speech Production Data
- Articulography (1)
- EEG (1)
- MRI (1)
- Ultrasound (1)
- Video (1)
-
Vocal Anatomy
(3)
- Larynx and Glottis (1)
- Vocal Tract (1)
Tags
- audio data (16)
- English (9)
- read speech (8)
- transcribed (7)
- adult (6)
- female (4)
- male (4)
- Mandarin (2)
- perceptually annotated (2)
- spontaneous speech (2)
- open-source (1)
- English accents (1)
- rainbow passage (1)
- labelled (1)
- non-speech (1)
- singing (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- British (1)
- angry (1)
- emotional speech (1)
- happy (1)
- sad (1)
- surprise (1)
- podcast (1)
- Spanish (1)
- bilingual (1)
- child speech (1)
- child-centered audio (1)
- mother-child interaction (1)
- conversation (1)
- dysarthria (1)
- pathological speech (1)
- speech-language pathology (1)
- ultrasound tongue imaging (UTI) (1)
- vowels (1)
- audiovisual (1)
- digits (1)
- video (1)
- whisper (1)
- L2 English (1)
- L2 speech (1)
- annotated (1)
- interview (1)
- language learning (1)
- electroglottography (EGG) (1)
- intraoral pressure (1)
- validation (1)
- multi-language (1)
- 3D head meshes (1)
- German (1)
- acoustic pharyngometry (1)
- electroencephalography (EEG) (1)
- electromagnetic articulography (EMA) (1)
- external craniofacial anthropometry (1)
- held vowel (1)
- rhinometry (1)
- syllable sequences (1)
- Amharic (1)
- Swahili (1)
- Wolof (1)
- Korean (1)
- French (1)
- MRI (1)
- real-time MRI (rtMRI) (1)
- volumetric MRI (1)
Resource type
- Dataset (9)
- Journal Article (3)
- Report (1)
- Software (1)
- Web Page (4)