Your search

  • Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

  • WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Last update from database: 28/05/2025, 04:10 (UTC)

Explore