Mistral Releases Its First Open-Source Speech Model, Voxtral — A Complete Game-Changer Surpassing Whisper

AI Daily News updated 4m ago dongdong

185 0

Mistral AI has released its first open-source speech model, Voxtral, available in 24B and 3B parameter versions. Open-sourced under the Apache 2.0 license, Voxtral also provides API access. It supports eight major languages and can handle 30-minute audio transcription or 40-minute semantic understanding tasks. Outperforming Whisper across the board, Voxtral excels in multilingual benchmark tests, ranks first in speech translation, and matches GPT-4o-mini in speech understanding capabilities.

© Copyright Notice

The copyright of the article belongs to the author. Please do not reprint without permission.

Related Posts

DeepSeek ranks as the second-largest AI research laboratory globally

DeepSeek ranks as the second-largest AI research laboratory globally

5m ago

01510

Google Launches Weather Lab Website: Uses DeepMind AI Model to Predict Typhoon Paths

Google Launches Weather Lab Website: Uses DeepMind AI Model to Predict Typhoon Paths

5m ago

01650

The deep research function of ChatGPT is connected to GitHub

The deep research function of ChatGPT is connected to GitHub

6m ago

01350

Google Launches NotebookLM Mobile App, Capable of Generating Intelligent Summaries and Audio Podcasts

Google Launches NotebookLM Mobile App, Capable of Generating Intelligent Summaries and Audio Podcasts

6m ago

01300

No comments yet...

none

No comments yet...