Mistral Releases Its First Open-Source Speech Model, Voxtral — A Complete Game-Changer Surpassing Whisper

AI Daily News updated 1w ago dongdong
15 0

Mistral AI has released its first open-source speech model, Voxtral, available in 24B and 3B parameter versions. Open-sourced under the Apache 2.0 license, Voxtral also provides API access. It supports eight major languages and can handle 30-minute audio transcription or 40-minute semantic understanding tasks. Outperforming Whisper across the board, Voxtral excels in multilingual benchmark tests, ranks first in speech translation, and matches GPT-4o-mini in speech understanding capabilities.

© Copyright Notice

Related Posts

No comments yet...

none
No comments yet...