Hey all I'm one of the co-creators of an open-source inference stack we ve been developing as part of our work at United We Care, a mental health and AI infra company.
We ve tested this stack against Whisper (OpenAI), ElevenLabs, NVIDIA, and Meta s public models and it consistently outperforms on:
WER (Speech-to-text): 6.2% on noisy, multi-accent corpora