Aleksandar Blazhev

Voxtral Transcribe 2 by Mistral - Real-time speech-to-text with speaker diarization

byβ€’
Voxtral Transcribe 2 delivers ultra-fast, highly accurate speech-to-text with real-time transcription and speaker diarization. Built for live apps, voice agents, and meetings, it supports 13 languages, word-level timestamps, and privacy-first deployment . All at industry-leading speed and cost.

Add a comment

Replies

Best
Aleksandar Blazhev
Hunter
πŸ“Œ
Hey everyone πŸ‘‹ Excited to share Voxtral Transcribe 2! Ultra-fast speech-to-text with real-time transcription and speaker diarization. Built for voice agents, meetings, and live apps, with sub-200ms latency, high accuracy, and strong multilingual support.
Kimberly Ross

@byalexaiΒ Congrats on the launch! How do you balance openness like open weights, or edge deployment, with ensuring quality, safety, and consistency for enterprise customers?

Nika

That moment when Mistral thinks faster than I speak. :D

Wilco Kruijer

It was definitely time for Mistral to launch something SOTA! Awesome.

Samet Sezer

at $0.003/min, this basically kills the margin for a lot of transcription wrappers. curious if the "diarization" actually handles people talking over each other (cross-talk), or if it still gets confused?

Laiba Danish

Real-time transcription with speaker diarization is a game-changer for meetings and interviews. Does it support multiple languages and export to editable formats like Word or Google Docs?

Mykyta Semenov πŸ‡ΊπŸ‡¦πŸ‡³πŸ‡±

Awesome! The speed is really great.

Tugay Pala

speaker diarization is always tricky. how does it perform with overlapping speech? and whats the latency for real-time use?

Tugay Pala

Speaker diarization is the feature that separates good transcription from great transcription! How many speakers can it reliably distinguish? And does the real-time aspect work well for live meetings or is there noticeable latency? Mistral's been shipping quality models consistently.