Your search
Results 7 resources
-
The frequencies, magnitudes, and bandwidths of vocal tract resonances are all important in understanding and synthesizing speech. High precision acoustic impedance spectra of the vocal tracts of 10 subjects were measured from 10 Hz to 4.2 kHz by injecting a broadband acoustic signal through the lips. Between 300 Hz and 4 kHz the acoustic resonances R (impedance minima measured through the...
-
Abstract The use of real-time magnetic resonance imaging (rt-MRI) of speech is increasing in clinical practice and speech science research. Analysis of such images often requires segmentation of articulators and the vocal tract, and the community is turning to deep-learning-based methods to perform this segmentation. While there are publicly available rt-MRI datasets of speech,...
-
Abstract The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus...
-
The USC Speech and Vocal Tract Morphology MRI Database consists of real-time magnetic resonance images of dynamic vocal tract shaping during read and spontaneous speech with concurrently recorded denoised audio, and 3D volumetric MRI of vocal tract shapes during vowels and continuant consonants sustained for 7 seconds, from 17 speakers.
-
USC-EMO-MRI is an emotional speech production database which includes real-time magnetic resonance imaging data with synchronized speech audio from five male and five female actors, each producing a passage and a set of sentences in multiple repetitions, while enacting four different target emotions (neutral, happy, angry, sad). The database includes emotion quality evaluation from at least...
-
USC-TIMIT is a database of speech production data under ongoing development, which currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English, and electromagnetic articulography data from four of these speakers. The two modalities were recorded in two independent sessions while the subjects produced the same 460 sentence corpus. In...
-
Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving...
Explore
Audio Data
- Emotional Speech (1)
-
Language
(1)
- French (1)
Derived & Measured Data
- Formant Measurements (1)
- Vocal Tract (1)
Software, Processing & Utilities
Speech Production Data
-
Vocal Anatomy
- Mechanical Properties (1)
- Vocal Tract (6)
- Articulography (1)
- MRI (6)
Tags
- female
- adult (7)
- male (7)
- real-time MRI (rtMRI) (6)
- MRI (5)
- audio data (5)
- English (4)
- articulatory data (4)
- read speech (4)
- volumetric MRI (3)
- speech production (3)
- multimodal (2)
- articulation (1)
- American English (1)
- electromagnetic articulography (EMA) (1)
- angry (1)
- emotional speech (1)
- happy (1)
- perceptually annotated (1)
- sad (1)
- consonants (1)
- vocal tract shape (1)
- vowels (1)
- French (1)
- transcribed (1)
- segmentation (1)
- antiresonance (1)
- vocal tract length (1)
- vocal tract resonance (1)
Resource type
- Dataset (4)
- Journal Article (3)