Mistral Voxtral: Shaking Up Big Tech’s Voice AI at $0.001 Per Minute
- Mistral AI Releases Open-Source Voice Model Voxtral
- Higher Accuracy at Half the Price of Whisper
- Two Versions Available: 24B and 3B
What Happened?
Mistral AI has released Voxtral, an open-source speech recognition model.[Mistral AI] The API costs $0.001 per minute, half the price of Whisper.
Small (24B) is for production, and Mini (3B) is for edge use.[Hugging Face]
Why is it Important?
It’s a price disruption in the voice AI market. Voxtral Small outperformed Gemini 2.5 Flash and GPT-4o-mini.[Slator] Supports 13 languages, including Korean. Real-time mode has latency under 200ms.
What Happens Next?
A powerful alternative has emerged in open source. OpenAI and Google’s response is noteworthy.
Frequently Asked Questions (FAQ)
Q: How is it different from Whisper?
A: Half the price, higher performance. The Whisper ecosystem is more mature.
Q: Can it be run locally?
A: Possible with Mini (3B). It has an Apache 2.0 license.
Q: Is Korean supported?
A: Included in the official 13 languages.
If you found this helpful, please subscribe to AI Digester.