Results | YorVoice Catalogue

Explore your larynx like never before with this physics-based interactive 3D model. Experiment with various laryngeal configurations as you move, rotate, pull and shake. Developed by award winning linguistics professor and phonetician, Dr. Scott Moisik this model is anatomically accurate and includes the structures that are not represented in other models. It is an excellent introduction to...

View on sites.google.com

Speech Acoustics YouTube Channel

Brad Story

View on www.youtube.com

Corpus Phonetics Tutorial

Eleanor Chodroff

Corpus phonetics has become an increasingly popular method of research in linguistic analysis. With advances in speech technology and computational power, large scale processing of speech data has become a viable technique. A fair number of researchers have exploited these methods, yet these techniques still remain elusive for many. In the words of Mark Liberman, there has been “surprisingly...

View on www.eleanorchodroff.com

R for Linguists

Eleanor Chodroff

This website provides a tutorial on R for Linguists. The tutorial provides learners with a foundation to the R programming language using RStudio. It covers the basics of the R grammar, including how to define variables, use R as a calculator, and do basic data manipulation. Building on basic data manipulation, it then covers how to work with full datasets and obtaining summary statistics...

View on www.eleanorchodroff.com

Speech Zone

Simon King

Hi – my name is Simon King and this is my personal website for supporting my teaching. I am the Professor of Speech Processing at the University of Edinburgh, where I teach courses in speech processing and speech synthesis at advanced undergraduate and Masters level. Use of this website Students: You may use this website freely for personal use. You may download copies of the content for your...

View on speech.zone

real-time MRI IPA charts

Asterios Toutios, Sajan Goud Lingala, Colin Vaz + 8 others

We have been collecting real-time MRI data from phoneticians producing the sounds of the International Phonetic Alphabet, together with standard sentences and texts. You may access the collected data by clicking on the pictures below.

View on sail.usc.edu

Speech Rate: Praat script that detects syllable nuclei

Nivja de Jong, Ton Wempe

Praat script that automatically detects syllable nuclei in order to measure speech rate without the need of a transcription. Peaks in intensity (dB) that are preceded and followed by dips in intensity are considered as potential syllable nuclei. The script subsequently discards peaks that are not voiced.

View on sites.google.com

SpeechBox: digital speech corpora

Anne R Bradlow

SpeechBox is a set of multiple speech resources. Each will be added to the YorVoice Data Catalogue as individual, searchable resources soon.

View on speechbox.linguistics.northwestern.edu

Multimodal Signal Processing (MSP) Podcast corpus

The MSP-Podcast corpus contains speech segments from podcast recordings which are perceptually annotated using crowdsourcing. The collection of this corpus is an ongoing process. Version 1.11 of the corpus has 151,654 speaking turns (237 hours and 56 mins). The proposed partition attempts to create speaker-independent datasets for Train, Development, Test1, Test2, and Test3 sets.

View on ecs.utdallas.edu

British National Corpus

The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English, both spoken and written, from the late twentieth century. Access the data here: https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2554

View on www.natcorp.ox.ac.uk

Voices Obscured in Complex Environmental Settings (VOiCES)

Colleen Richey, Maria A. Barrios, Zeb Armstrong + 11 others

The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...

View on iqtlabs.github.io

AudioSet

A sound vocabulary and dataset AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By...

View on research.google.com

openslr.org

Open SLR is a set of multiple speech resources. Each will be added to the YorVoice Data Catalogue as individual, searchable resources soon.

View on www.openslr.org

VoxForge

VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).

View on www.voxforge.org

3D printable vocal tract tube models

Takayuki Arai

The Vowel by Chiba and Kajiyama (1941-42) integrated the mechanisms of vowel production and perception from the viewpoints of physiology, physics and psychology in a single book. In a section of this book, the authors conducted an experiment where they made physical models of the human vocal tract based on their measurements and produced vowels from those models. As a result, they confirmed...

View on splab.net

Your search

Results 15 resources

Explore

Audio Data

Derived & Measured Data

Speech Production Data

Teaching Resources

Software, Processing & Utilities

Tags

Resource type