Your search
Results 15 resources
-
Explore your larynx like never before with this physics-based interactive 3D model. Experiment with various laryngeal configurations as you move, rotate, pull and shake. Developed by award winning linguistics professor and phonetician, Dr. Scott Moisik this model is anatomically accurate and includes the structures that are not represented in other models. It is an excellent introduction to...
-
Corpus phonetics has become an increasingly popular method of research in linguistic analysis. With advances in speech technology and computational power, large scale processing of speech data has become a viable technique. A fair number of researchers have exploited these methods, yet these techniques still remain elusive for many. In the words of Mark Liberman, there has been “surprisingly...
-
This website provides a tutorial on R for Linguists. The tutorial provides learners with a foundation to the R programming language using RStudio. It covers the basics of the R grammar, including how to define variables, use R as a calculator, and do basic data manipulation. Building on basic data manipulation, it then covers how to work with full datasets and obtaining summary statistics...
-
Hi – my name is Simon King and this is my personal website for supporting my teaching. I am the Professor of Speech Processing at the University of Edinburgh, where I teach courses in speech processing and speech synthesis at advanced undergraduate and Masters level. Use of this website Students: You may use this website freely for personal use. You may download copies of the content for your...
-
We have been collecting real-time MRI data from phoneticians producing the sounds of the International Phonetic Alphabet, together with standard sentences and texts. You may access the collected data by clicking on the pictures below.
-
Praat script that automatically detects syllable nuclei in order to measure speech rate without the need of a transcription. Peaks in intensity (dB) that are preceded and followed by dips in intensity are considered as potential syllable nuclei. The script subsequently discards peaks that are not voiced.
-
SpeechBox is a set of multiple speech resources. Each will be added to the YorVoice Data Catalogue as individual, searchable resources soon.
-
The MSP-Podcast corpus contains speech segments from podcast recordings which are perceptually annotated using crowdsourcing. The collection of this corpus is an ongoing process. Version 1.11 of the corpus has 151,654 speaking turns (237 hours and 56 mins). The proposed partition attempts to create speaker-independent datasets for Train, Development, Test1, Test2, and Test3 sets.
-
The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English, both spoken and written, from the late twentieth century. Access the data here: https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2554
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
-
A sound vocabulary and dataset AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By...
-
Open SLR is a set of multiple speech resources. Each will be added to the YorVoice Data Catalogue as individual, searchable resources soon.
-
VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
-
The Vowel by Chiba and Kajiyama (1941-42) integrated the mechanisms of vowel production and perception from the viewpoints of physiology, physics and psychology in a single book. In a section of this book, the authors conducted an experiment where they made physical models of the human vocal tract based on their measurements and produced vowels from those models. As a result, they confirmed...
Explore
Audio
-
Accent/Region
(1)
- British English (1)
- Emotional Speech (1)
- Language (5)
- Multi-Speaker (5)
- Singing (1)
- Speech in Noise (1)
Derived & Measured Data
- Vocal Tract (1)
Software, Processing & Utilities
- Phone Apps (1)
- Speech Processing (1)
Speech Production & Articulation
- MRI (1)
Teaching Resources
- 3D Models (2)
- Articulation Data (1)
- Tutorials (2)
- Videos (2)
Vocal Anatomy
- Vocal Tract (1)
Tags
- teaching resource (4)
- audio data (4)
- English (3)
- read speech (3)
- transcribed (2)
- adult (2)
- 3D print (1)
- STL files (1)
- source-filter model (1)
- tube model (1)
- vowels (1)
- open-source (1)
- speech recognition (1)
- labelled (1)
- non-speech (1)
- singing (1)
- environmental noise (1)
- female (1)
- male (1)
- noisy audio (1)
- reverberation (1)
- British (1)
- perceptually annotated (1)
- podcast (1)
- Praat (1)
- speech rate (1)
- syllable (1)
- syllable nuclei (1)
- International Phonetic Alphabet (IPA) (1)
- phonetics (1)
- real-time MRI (rtMRI) (1)
- speech processing (1)
- speech synthesis (1)
- speech acoustics (1)
- anatomy (1)
- app (1)
- larynx (1)