Full catalogue
Real-time speech MRI datasets with corresponding articulator ground-truth segmentations
Resource type
Authors/contributors
- Ruthven, Matthieu (Author)
- Peplinski, Agnieszka M. (Author)
- Adams, David M. (Author)
- King, Andrew P. (Author)
- Miquel, Marc Eric (Author)
Title
Real-time speech MRI datasets with corresponding articulator ground-truth segmentations
Abstract
Abstract
The use of real-time magnetic resonance imaging (rt-MRI) of speech is increasing in clinical practice and speech science research. Analysis of such images often requires segmentation of articulators and the vocal tract, and the community is turning to deep-learning-based methods to perform this segmentation. While there are publicly available rt-MRI datasets of speech, these do not include ground-truth (GT) segmentations, a key requirement for the development of deep-learning-based segmentation methods. To begin to address this barrier, this work presents rt-MRI speech datasets of five healthy adult volunteers with corresponding GT segmentations and velopharyngeal closure patterns. The images were acquired using standard clinical MRI scanners, coils and sequences to facilitate acquisition of similar images in other centres. The datasets include manually created GT segmentations of six anatomical features including the tongue, soft palate and vocal tract. In addition, this work makes code and instructions to implement a current state-of-the-art deep-learning-based method to segment rt-MRI speech datasets publicly available, thus providing the community and others with a starting point for developing such methods.
Publication
Scientific Data
Volume
10
Issue
1
Pages
860
Date
2023-12-02
Journal Abbr
Sci Data
Language
en
ISSN
2052-4463
Accessed
25/04/2025, 18:44
Library Catalog
DOI.org (Crossref)
Citation
Ruthven, M., Peplinski, A. M., Adams, D. M., King, A. P., & Miquel, M. E. (2023). Real-time speech MRI datasets with corresponding articulator ground-truth segmentations. Scientific Data, 10(1), 860. https://doi.org/10.1038/s41597-023-02766-z
Software, Processing & Utilities
Speech Production & Articulation
Vocal Anatomy
Link to this record