Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Discover similar tools to enhance your workflow
Go from text to speech with a versatile AI voice generator AI-enabled, real people's voices Make ...
PoplarML enables the deployment of production-ready, scalable ML systems with minimal engineering...
Open source tool for running large language models like BLOOM-176B collaboratively — you load a...