Mistral Voxtral: Shaking Up Big Tech Voice AI at ₩0.1 Per Minute

Mistral Voxtral: Shaking Up Big Tech’s Voice AI at $0.001 Per Minute

  • Mistral AI Releases Open-Source Voice Model Voxtral
  • Higher Accuracy at Half the Price of Whisper
  • Two Versions Available: 24B and 3B

What Happened?

Mistral AI has released Voxtral, an open-source speech recognition model.[Mistral AI] The API costs $0.001 per minute, half the price of Whisper.

Small (24B) is for production, and Mini (3B) is for edge use.[Hugging Face]

Why is it Important?

It’s a price disruption in the voice AI market. Voxtral Small outperformed Gemini 2.5 Flash and GPT-4o-mini.[Slator] Supports 13 languages, including Korean. Real-time mode has latency under 200ms.

What Happens Next?

A powerful alternative has emerged in open source. OpenAI and Google’s response is noteworthy.

Frequently Asked Questions (FAQ)

Q: How is it different from Whisper?

A: Half the price, higher performance. The Whisper ecosystem is more mature.

Q: Can it be run locally?

A: Possible with Mini (3B). It has an Apache 2.0 license.

Q: Is Korean supported?

A: Included in the official 13 languages.


If you found this helpful, please subscribe to AI Digester.

References

Leave a Comment