MLAAD: The Multi-Language Audio Anti-Spoofing Dataset
Resource type
Authors/contributors
- Müller, Nicolas M. (Author)
- Kawa, Piotr (Author)
- Choong, Wei Herng (Author)
- Casanova, Edresson (Author)
- Gölge, Eren (Author)
- Müller, Thorsten (Author)
- Syga, Piotr (Author)
- Sperl, Philip (Author)
- Böttinger, Konstantin (Author)
Title
MLAAD: The Multi-Language Audio Anti-Spoofing Dataset
Abstract
We present the MLAAD dataset, which is a multi-language dataset for the task of audio anti-spoofing. This dataset has been created using a diverse set of text-to-speech (TTS) models, and is designed to evaluate the out-of-domain generalization of anti-spoofing systems, both with respect to new languages, as well as new TTS models. Specifically, MLAAD comprises:
678.3 hours of synthetic voice,
in 51 different languages,
created with 140 TTS models, comprising 78 different architectures.
The dataset is supposed to be used in conjunction with the M-AILABS dataset . MLAAD provides only the synthetic audio, while M-AILABS provides the real audio.
Citation Key
_bh
Citation
Müller, N. M., Kawa, P., Choong, W. H., Casanova, E., Gölge, E., Müller, T., Syga, P., Sperl, P., & Böttinger, K. (n.d.). MLAAD: The Multi-Language Audio Anti-Spoofing Dataset [Dataset]. Retrieved https://huggingface.co/datasets/mueller91/MLAAD
Audio Data
Link to this record
Relations