Brian Richards

Brian Richards

Immersive Media Maker
Vois

What's great

Creating with Text to Speech software, even with the best AI tools like ElevenLabs is a highly iterative process, trying out voices, editing scripts for pacing and pronunciation, etc. etc., especially if you need multiple speakers for a podcast, or an audiobook or a radio play with multiple characters. Vois.so supports multiple speakers with automatic recognition when importing scripts. Very few TTS systems do that at present, combine that with it’s killer feature, it runs locally on your computer and does not operate on a token or time basis, just a very reasonable monthly or annual fixed cost. So however many interactions, generations or how much text you throw at it the cost is the same. This is new software, with a responsive developer who is actively supporting users and with considerable plans to build on a strong foundation. For a monthly cost at a fraction of its competitors this is well worth trying, there is even a free version with unlimited text and 10 generations a day, and full audio export for just $5 a go. I have been using this software for a few weeks now, and can highly recommend it.

What needs improvement

Speed of generation on some Windows systems needs GPU acceleration and does not yet compare with performance on Apple systems. Although there are a wide range of languages and styles there is room for more, and for customization

vs Alternatives

A responsive developer who is actively supporting users, and with considerable plans to build on a strong foundation. For the annual price close to the monthly cost of its competitors this is well worth considering.

Ratings
Ease of use
Reliability
Value for money
Customization
45 views