MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

Resource type

Authors/contributors

Müller, Nicolas M. (Author)
Kawa, Piotr (Author)
Choong, Wei Herng (Author)
Casanova, Edresson (Author)
Gölge, Eren (Author)
Müller, Thorsten (Author)
Syga, Piotr (Author)
Sperl, Philip (Author)
Böttinger, Konstantin (Author)

Title

Abstract

We present the MLAAD dataset, which is a multi-language dataset for the task of audio anti-spoofing. This dataset has been created using a diverse set of text-to-speech (TTS) models, and is designed to evaluate the out-of-domain generalization of anti-spoofing systems, both with respect to new languages, as well as new TTS models. Specifically, MLAAD comprises: 678.3 hours of synthetic voice, in 51 different languages, created with 140 TTS models, comprising 78 different architectures. The dataset is supposed to be used in conjunction with the M-AILABS dataset . MLAAD provides only the synthetic audio, while M-AILABS provides the real audio.

Citation Key

_bh

URL

https://huggingface.co/datasets/mueller91/MLAAD

Citation

Müller, N. M., Kawa, P., Choong, W. H., Casanova, E., Gölge, E., Müller, T., Syga, P., Sperl, P., & Böttinger, K. (n.d.). MLAAD: The Multi-Language Audio Anti-Spoofing Dataset [Dataset]. Retrieved https://huggingface.co/datasets/mueller91/MLAAD

Audio Data

Synthetic Speech