Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Discover similar tools to enhance your workflow
Polymath uses machine learning to convert any music library (e.g from Hard-Drive or YouTube) into...
GPU Everything. Run anything Dockerized. Run autoscale Inference. Save costs 50-90%.
AI-powered search to find code by searching for what it does, not just what it is. Once you find...