Full catalogue
AudioSet
Resource type
Title
AudioSet
Abstract
A sound vocabulary and dataset
AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.
By releasing AudioSet, we hope to provide a common, realistic-scale evaluation task for audio event detection, as well as a starting point for a comprehensive vocabulary of sound events.
Accessed
22/11/2024, 14:03
Citation
AudioSet. (n.d.). Retrieved November 22, 2024, from https://research.google.com/audioset/index.html
Audio
Link to this record