Results | YorVoice Catalogue

Re-Training Extension of the Benchmark for Automatic Glottis Segmentation (BAGLS-RT)

Michael Döllinger, Tobias Schraut, Lea Henrich + 9 others

BAGLS-RT is an extension of the BAGLS dataset (DOI 10.5281/zenodo.3762320) intended for (re-)training glottis segmentation models.

View on zenodo.org

Benchmark for Automatic Glottis Segmentation (BAGLS)

Pablo Gómez, Andreas M Kist, Patrick Schlegel + 10 others

BAGLS is a benchmark dataset intended to compare performance across automatic glottis segmentation methods.

View on zenodo.org

Frequencies, bandwidths and magnitudes of vocal tract and surrounding tissue resonances, measured through the lips during phonation

Noel Hanna, John Smith, Joe Wolfe

The frequencies, magnitudes, and bandwidths of vocal tract resonances are all important in understanding and synthesizing speech. High precision acoustic impedance spectra of the vocal tracts of 10 subjects were measured from 10 Hz to 4.2 kHz by injecting a broadband acoustic signal through the lips. Between 300 Hz and 4 kHz the acoustic resonances R (impedance minima measured through the...

View on pubs.aip.org

Hyoid CT DICOM files

A zip archive of several series of DICOM files from two ex-vivo hyoid specimens: one adult and one child. Each specimen was scanned at different slice thicknesses, as described and used in Cotter et al., 2015.

View on web.waisman.wisc.edu

Mandible CT DICOM files

A zip archive of several series of DICOM files from three ex-vivo mandible specimens: two adult and one child. Each specimen was scanned at different slice thicknesses, as described and used in Whyms et al., 2013.

View on web.waisman.wisc.edu

Semi-automatic mandible segmentation (SAMS) pipeline

View on samsdoc.readthedocs.io

A Database for Validating Physical Models of the Glottis in the Control of Voicing

Jacob Rosen, Howard Nusbaum, John Veillette + 1 others

This dataset contains simultaneous recordings of electroglottography (EGG recorded with Glottal Enterprises EG2-PCX2), unfiltered audio, and intraoral pressure (recorded with Glottal Enterprises PG-60) from 14 subjects. It is meant to facilitate the validation of physical models of glottal control during voicing, in which the glottal/source waveform for speech is controlled by a combination of...

View on osf.io

Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties

Peter Birkholz, Steffen Kürbis, Simon Stone + 3 others

Abstract A detailed understanding of how the acoustic patterns of speech sounds are generated by the complex 3D shapes of the vocal tract is a major goal in speech research. The Dresden Vocal Tract Dataset (DVTD) presented here contains geometric and (aero)acoustic data of the vocal tract of 22 German speech sounds (16 vowels, 5 fricatives, 1 lateral), each from one male and one...

View on www.nature.com

Aalto Vowels

Currently available data set consists of the DICOM-datafiles and corresponding sound samples for all the finnish vowels. Some derivatives obtained from the image and sound data are also provided, this includes the surface models for the vowels.

View on speech.math.aalto.fi

Real-time speech MRI datasets with corresponding articulator ground-truth segmentations

Matthieu Ruthven, Agnieszka M. Peplinski, David M. Adams + 2 others

Abstract The use of real-time magnetic resonance imaging (rt-MRI) of speech is increasing in clinical practice and speech science research. Analysis of such images often requires segmentation of articulators and the vocal tract, and the community is turning to deep-learning-based methods to perform this segmentation. While there are publicly available rt-MRI datasets of speech,...

View on www.nature.com

Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers

Karyna Isaieva, Yves Laprie, Justine Leclère + 3 others

Abstract The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus...

View on www.nature.com

Seeing Speech

E. Lawson, J. Stuart-Smith, J. M. Scobbie + 1 others

Welcome to our interactive International Phonetic Association (IPA) chart website! Clicking on the IPA symbols on our charts will allow you to listen to their sounds and see vocal-organ movements imaged with ultrasound, MRI, or in animated form. To find out more about how our IPA charts were made, click on the buttons on the left-hand side of this page. The website contains two main...

View on seeingspeech.ac.uk

mngu0

Korin Richmond

This is a corpus of articulatory data of different forms (EMA, MRI, video, 3D scans of upper/lower jaw, audio etc.) acquired from one male British English speaker.

View on www.mngu0.org

USC Speech and Vocal Tract Morphology MRI Database

Tanner Sorensen, Zisis Skordilis, Asterios Toutios + 9 others

The USC Speech and Vocal Tract Morphology MRI Database consists of real-time magnetic resonance images of dynamic vocal tract shaping during read and spontaneous speech with concurrently recorded denoised audio, and 3D volumetric MRI of vocal tract shapes during vowels and continuant consonants sustained for 7 seconds, from 17 speakers.

View on sail.usc.edu

USC-EMO-MRI: An emotional speech production database

Jangwon Kim, Asterios Toutios, Yoon-Chul Kim + 3 others

USC-EMO-MRI is an emotional speech production database which includes real-time magnetic resonance imaging data with synchronized speech audio from five male and five female actors, each producing a passage and a set of sentences in multiple repetitions, while enacting four different target emotions (neutral, happy, angry, sad). The database includes emotion quality evaluation from at least...

View on sail.usc.edu

Your search

Results 20 resources

Explore

Audio Data

Derived & Measured Data

Software, Processing & Utilities

Speech Production Data

Teaching Resources

Tags

Resource type