Mistral debuts Voxtral Transcribe 2, a family of speech-to-text models with speaker diarization and ultra-low latency, under the Apache 2.0 open-weight license
AI assistants are going voice-first, and Mistral AI just launched its models to compete. — On Wednesday, the French AI startup …
Voxtral Realtime is built for voice agents and live applications. Its natively streaming architecture delivers latency configurable to sub-200ms. And at 480ms, it stays within 1-2% WER of our offline model. We release the model as open weights under Apache 2.0. [image]
The demo on https://huggingface.co/... is worth a try - ignore the “No microphone found” message, clicking “Record” and allowing your browser to use a microphone fixes that. It transcribes very accurately in almost real-time. It's really impressive.