Search

Full catalogue 113 resources

Page 4 of 8

Abstracts

Corpus Phonetics Tutorial

Eleanor Chodroff

Corpus phonetics has become an increasingly popular method of research in linguistic analysis. With advances in speech technology and computational power, large scale processing of speech data has become a viable technique. A fair number of researchers have exploited these methods, yet these techniques still remain elusive for many. In the words of Mark Liberman, there has been “surprisingly...

View on www.eleanorchodroff.com
R for Linguists

Eleanor Chodroff

This website provides a tutorial on R for Linguists. The tutorial provides learners with a foundation to the R programming language using RStudio. It covers the basics of the R grammar, including how to define variables, use R as a calculator, and do basic data manipulation. Building on basic data manipulation, it then covers how to work with full datasets and obtaining summary statistics...

View on www.eleanorchodroff.com
The Use and Utility of Localised Speech Forms in Determining Identity Corpus (TUULS) - Vowel Formant Frequency Data

Carmen Llamas, Peter French, Dominic Watt + 5 others

The data deposited are taken from fieldwork recordings undertaken with speakers from the three fieldwork sites, Newcastle, Sunderland and Middlesbrough in the Northeast of England. From each locality, 40 informants were recorded, giving a total of 120 informants. The key data in the file All_formants_July_2018.xlsx are vowel formant frequency measurements, in Hertz, for the peripheral...

View on reshare.ukdataservice.ac.uk
Diapix Adaptation Project

View on speechbox.linguistics.northwestern.edu
Diachronic Electronic Corpus of Tyneside English (DECTE)

Karen P. Corrigan, Isabelle Buchstaller, Adam Mearns + 1 others

DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.

View on research.ncl.ac.uk
University College London’s Archive of Stuttered Speech (UCLASS)

This site allows visitors to access recordings of speakers who stutter and background details about these speakers and the conditions in which the recordings were made. The recordings are available in various formats. The main two sets of recordings were made in normal speaking conditions and the final one was made when the sound of the speaker’s voice was altered as he or she spoke. The three...

View on www.uclass.psychol.ucl.ac.uk
Speech Accessibility Project

The current data package includes 1,090 hours of recorded speech (as .wav files) from about 1,130 participants, including those with ALS, cerebral palsy, Down syndrome, Parkinson’s disease and those who have had a stroke. The download also includes text of the original speech prompts and a transcript of the participants’ responses. A subset includes annotations describing the speech...

View on speechaccessibilityproject.beckman.illinois.edu
Community-Supported Shared Infrastructure in Support of Speech Accessibility

Mark Hasegawa-Johnson, Xiuwen Zheng, Heejin Kim + 22 others

Purpose: The Speech Accessibility Project (SAP) intends to facilitate research and development in automatic speech recognition (ASR) and other machine learning tasks for people with speech disabilities. The purpose of this article is to introduce this project as a resource for researchers, including baseline analysis of the first released data package. ...

View on pubs.asha.org
Buckeye Corpus

The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).

View on buckeyecorpus.osu.edu
Audiovisual Whisper (AVW) Corpus

The MSP-AVW is an audiovisual whisper corpus for audiovisual speech recognition purpose. The MSP-AVW corpus contains data from 20 female and 20 male speakers. For each subject, three sessions are recorded consisting of read sentences, isolated digits and spontaneous speech. The data is recorded under neutral and whisper conditions. The corpus was collected in a 13ft x 13ft ASHA certified...

View on ecs.utdallas.edu
A comparative study of language change in Northern Englishes

William Haddican, Paul Foulkes

This 3-year project investigates language change in five urban dialects of Northern England—Derby, Newcastle, York, Leeds and Manchester. Data collection method: Linguistic analysis of speech data (conversational, word list) from samples of different northern English urban communities. Data collection consisted of interviews, which included (1) some structured questions about the interviewee...

View on reshare.ukdataservice.ac.uk
An Audio-Ultrasound Synchronized Database of Tongue Movement for Mandarin speech

Yudong Yang, Rongfeng Su, Shaofeng Zhao + 4 others

Ultrasound imaging has been widely adopted in speech research to visualize dynamic tongue movements during speech production. These images are universally used as visual feedback in interventions for articulation disorders or visual cues in speech recognition. Nevertheless, the availability of high-quality audio-ultrasound datasets remains scarce. The present study, therefore, aims to...

View on www.nature.com
Real-time speech MRI datasets with corresponding articulator ground-truth segmentations

Matthieu Ruthven, Agnieszka M. Peplinski, David M. Adams + 2 others

Abstract The use of real-time magnetic resonance imaging (rt-MRI) of speech is increasing in clinical practice and speech science research. Analysis of such images often requires segmentation of articulators and the vocal tract, and the community is turning to deep-learning-based methods to perform this segmentation. While there are publicly available rt-MRI datasets of speech,...

View on www.nature.com
Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers

Karyna Isaieva, Yves Laprie, Justine Leclère + 3 others

Abstract The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus...

View on www.nature.com
An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images

Michel Belyk

Real-time magnetic resonance imaging (rtMRI) is a technique that provides high-contrast videographic data of human anatomy in motion. Applied to the vocal tract, it is a powerful method for capturing the dynamics of speech and other vocal behaviours by imaging structures internal to the mouth and throat. These images provide a means of studying the physiological basis for speech, singing,...

View on osf.io

Page 4 of 8

Main feed

Last update from database: 16/07/2025, 04:10 (UTC)

Search

Full catalogue 113 resources

Explore

Audio

Benchmarks & Validation

Derived & Measured Data

Software, Processing & Utilities

Speech Production & Articulation

Teaching Resources

Vocal Anatomy

Tags

Resource type