Results | YorVoice Catalogue

The Fake-or-Real (FoR) dataset is a collection of more than 195,000 utterances from real humans and computer generated speech. The dataset can be used to train classifiers to detect synthetic speech. The dataset aggregates data from the latest TTS solutions (such as Deep Voice 3 and Google Wavenet TTS) as well as a variety of real human speech, including the Arctic Dataset...

View on bil.eecs.yorku.ca

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

Nicolas M. Müller, Piotr Kawa, Wei Herng Choong + 6 others

We present the MLAAD dataset, which is a multi-language dataset for the task of audio anti-spoofing. This dataset has been created using a diverse set of text-to-speech (TTS) models, and is designed to evaluate the out-of-domain generalization of anti-spoofing systems, both with respect to new languages, as well as new TTS models. Specifically, MLAAD comprises: 678.3 hours of synthetic...

View on huggingface.co

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Haibin Wu, Yuan Tseng, Hung-yi Lee

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems TL;DR: We show that better detection of deepfake speech from codec-based TTS systems can be achieved by training models on speech re-synthesized with neural audio codecs. This dataset is released for this purpose. See our paper and Github for more details on using our...

View on huggingface.co

In-the-Wild: A Deepfake Detection Dataset

Nicolas M. Müller, Pavel Czempin, Franziska Dieckmann + 2 others

The In-the-Wild dataset contains real and synthetic speech recordings of 58 celebrities and politicians, collected from online videos. It provides a realistic benchmark for testing how well audio deepfake detection models generalize beyond laboratory data such as ASVspoof. Task: Audio Classification (Deepfake / Genuine) Languages: English Modality: Audio Size: 37.9 hours total 17.2 hours fake 20.7 hours real

View on huggingface.co

SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods

Wen Huang, Yanmei Gu, Zhiming Wang + 2 others

SpeechFake is a large-scale multilingual dataset for speech deepfake detection, featuring over 3 million fake samples across 46 languages. Generated using 30 diverse open-source models* spanning text-to-speech (TTS), voice conversion or clone (VC), and neural vocoder (NV) methods, it offers rich metadata and strong coverage of modern generation techniques, enabling robust and generalizable detection research.

View on github.com

ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech

Xin Wang, Héctor Delgado, Hemlata Tak + 26 others

This is the Zenodo repository for the ASVspoof 5 database. ASVspoof 5 is the fifth edition in a series of challenges which promote the study of speech spoofing and deepfake attacks, and the design of detection solutions. Compared to previous challenges, the ASVspoof~5 database is built from crowdsourced data collected from around 2,000 speakers in diverse acoustic conditions. More than 20...

View on zenodo.org

PartialSpoof Database - Partially Spoofed Audio Dataset for Anti-spoofing

Lin Zhang, Xin Wang, Erica Cooper + 3 others

All existing databases of spoofed speech contain attack data that is spoofed in its entirety. In practice, it is entirely plausible that successful attacks can be mounted with utterances that are only partially spoofed. By definition, partially-spoofed utterances contain a mix of both spoofed and bona fide segments, which will likely degrade the performance of countermeasures trained with...

View on zenodo.org

ASVspoof

The automatic speaker verification spoofing and countermeasures (ASVspoof) challenge series is a community-led initiative which aims to promote the consideration of spoofing and deepfakes and the development of countermeasures.

View on www.asvspoof.org

Your search

Results 8 resources

Explore

Audio Data

Tags

Resource type