Your search

Tags
  • Sinhala ASR training data set containing ~185K utterances. This data set contains transcribed audio data for Sinhala. The data set consists of wave files, and a TSV file. The file utt_spk_text.tsv contains a FileID, anonymized UserID and the transcription of audio in the file. The data set has been manually quality checked, but there might still be errors.

Last update from database: 19/05/2026, 04:10 (UTC)

Explore

Resource type