Search

Full catalogue 115 resources

Page 1 of 8

Abstracts

rtMRIDB Speech Organ Contour Data Ver. 0.9

Kikuo Maekawa, Hironori Takemoto

We are releasing the rtMRIDB Speech Organ Contour Data Ver. 0.9 (abb. rtMRI_cont). This dataset provides numerical data extracted from each frame of the real-time MRI videos published in the Realtime MRI Articulatory Movement Database, Ver. 2 (rtMRIDB_v2) [1], containing contour information of speech organs. Since this dataset may be updated in the near future, it is being released as a...

View on rtmridb.ninjal.ac.jp
rtMRIDB (The real-time MRI articulatory movement database)

Kikuo Maekawa

This is a database of moving images of the midsagittal section of the vocal tract during the production of Japanese utterances, recorded at a rate of 14 or 27 frames per second by using a medical MRI system with special operating settings. This data has realized the dream of articulatory phoneticians to visualize articulatory movements and may be widely used for critical review of the existing...

View on rtmridb.ninjal.ac.jp
The Edinburgh International Accents of English Corpus

Ramon Sanabria, Nina Markl, Andrea Carmantini + 4 others

English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a result, there are many varieties of English. Although the great many advances in English automatic speech recognition (ASR) over the past decades, results are usually reported based on test datasets which fail to represent the diversity of...

View on datashare.ed.ac.uk
The Sociolinguistic Archive and Analysis Project (SLAAP)

Tyler Kendall

The Sociolinguistic Archive and Analysis Project, at North Carolina State University, is an interactive web-based archive of sociolinguistic recordings, with integrated media playing and annotation features, as well as phonetic analysis and corpus analysis tools designed for enabling and improving empirical linguistic inquiry. The archive continues to grow over time. It currently contains (as...

View on slaap.chass.ncsu.edu
Synthetic vowels generated with 1D and 3D acoustic models

Rémi Blandin Blandin, Simon Stone Stone, Angélique Remacle Remacle + 2 others

This dataset contains the synthetic stimuli used in the study published in the paper "A Comparative Study of 3D and 1D Acoustic Simulations of the Higher Frequencies of Speech". The goal of this study was to evaluate the accuracy of the acoustic wave propagation in the vocal tract in a source-filter synthesis paradigm with two perceptual experiments. The high frequencies (above 4 kHz) of the...

View on ieee-dataport.org
The Tongue and Lips Corpus

M. S. Ribeiro, J. Sanger, J.-X. Zhang + 4 others

A multi-speaker corpus of ultrasound images of the tongue and video images of the lips The Tongue and Lips (TaL) corpus is a multi-speaker corpus of ultrasound images of the tongue and video images of lips. This corpus contains synchronised imaging data of extraoral (lips) and intraoral (tongue) articulators from 82 native speakers of English. The TaL corpus consists of two datasets: - TaL1...

View on ultrasuite.github.io
Vocal Learning in Adulthood: Investigating the mechanisms of vocal imitation using MRI of the vocal tract and brain 2015-2018

Carolyn McGettigan, Marc Miquel, Daniel Carey + 2 others

This collection contains behavioural and brain activation data from 3 laboratory studies of speech imitation. Each of the three studies involved behavioural and imaging (MRI) test sessions in which participants were familiarised with novel auditory speech targets, and were asked to imitate them as closely as possible. Across the three studies, there were variations in the type of sounds...

View on reshare.ukdataservice.ac.uk
VOICED Database

Laura Verde, Giovanna Sannino

This database includes clinically-verified 208 voice samples, from 150 pathological voices and 58 healthy voices. The database also includes information such as gender, age, pathology, lifestyle habits (e.g. smoking, alcohol and coffee consummation), occupational status, and the results of two specific medical questionnaires: the Voice Handicap Index (VHI) and Reflux Symptom Index...

View on physionet.org
Data from: Voice efficiency for different voice qualities combining experimentally derived sound signals and numerical modeling of the vocal tract

M. Fleischer, S. Rummel, F. Stritt + 5 others

This dataset contains Stereo-Lithographic (STL) surface models of a human vocal tract, derived Finite-Element-Models, numerical results, and scripts for analyzing these results and (re-)running the computation. In the main folder, this dataset contains: 1) Python files (*fig*.py) for the creation of figures and tables (*tab*.py) 2) Python files (*.py) for analyzing Finite-Element (FE)...

View on zenodo.org
Perceptual Voice Qualities Database (PVQD)

Patrick R Walden

This database was created through generous funding from The Voice Foundation's Advancing Scientific Voice Research Grant and contains voice samples which have been rated by experienced voice professionals (at least 3 different raters with a minimum of 3 years’ clinical experience) in order to provide educators with standardized materials to better train pre-service clinical voice...

View on data.mendeley.com
The UCLA Phonetics Lab Archive

Peter Ladefoged

For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic...

View on archive.phonetics.ucla.edu
VoxAngeles

E Chodroff, B. Pažon, A. Baker + 1 others

VoxAngeles is a corpus of audited phonetic transcriptions and phone-level alignments of the UCLA Phonetics Lab Archive (Ladefoged et al., 2009, http://archive.phonetics.ucla.edu/), along with phonetic measurements including word and phone durations, vowel f0 and vowel formants. The audited portion of the corpus currently contains data from 95 languages across 21 language families. Unaudited...

View on github.com
An ultrasound study of lingual coarticulation in children and adults

Natalia Zharkova

Coarticulation, one of the central issues in experimental phonetic research, refers to the articulatory overlap of neighbouring sounds, resulting in acoustic and perceptual modifications of these sounds. Studies of the development of coarticulatory patterns in children have produced conflicting results concerning adult-child differences. This research compares coarticulatory properties of...

View on reshare.ukdataservice.ac.uk
Acted clear speech corpus

Catherine Mayo, Catherine Mayo

Single male native British English talker recorded producing 25 TIMIT sentences in 5 conditions, two natural: (i) quiet, (ii) while the talker listened to high-intensity speech-shaped noise, and three acted: (i) as if to a non-native listener, (ii) as if to a computer speech-recognition system, (iii) as if to an infant. Accompanied by automatic and hand-corrected phone-level transcription.

View on datashare.ed.ac.uk
The Correspondence of Vocal Tract Resonance With Volumes Obtained From Magnetic Resonance Images

Christopher A. Moore

The increasing availability of magnetic resonance imaging (MRI) as a research, and even clinical, tool in speech production makes possible a wide range of quantitative methods in vocal tract measurement. In these initial stages of application, it is essential that the limits of the method be identified. The present investigation was designed to apply the techniques of digital image analysis...

View on pubs.asha.org

Page 1 of 8

Main feed

Last update from database: 18/02/2026, 04:10 (UTC)

Search

Full catalogue 115 resources

Explore

Audio Data

Derived & Measured Data

Software, Processing & Utilities

Speech Production Data

Teaching Resources

Tags

Resource type