Praney Behl

Vois - Studio-quality text-to-speech and voice cloning, fully local

by•
Vois is a desktop voice studio for turning scripts, ebooks, articles, and podcasts into natural audio with 63 voices, voice cloning, and pro editing — no uploads, no per-character fees, no usage caps. Cloud voice tools charge per character, cap usage, and upload your scripts. Vois gives you studio-quality speech, voice cloning, and editing fully on your laptop or desktop.

Add a comment

Replies

Best
Praney Behl
Maker
šŸ“Œ
Hey Product Hunt! šŸ‘‹ I'm Praney, and I built Vois as a solo maker over the past year. The backstory is personal. I'm partially dyslexic — long text has always been a struggle for me. Since high school, I've been converting articles, reports, academic papers, and white papers to audio so I could actually absorb them. I built a tool for myself that did exactly that. When I showed it to others, something unexpected happened. A friend wanted it for creating custom bedtime stories for her kids. Another had a stack of ebooks he'd bought but never read — he wanted to convert them to audio for his commute. Others, like me, had ADHD or dyslexia and immediately got it. That personal tool evolved into Vois — a full desktop voice AI studio. I also always wanted to create my own podcast but never felt my voice was good enough and didn't want to deal with the complexity of editing. Vois gives me that out of the box. What it does: → 63 studio-quality voices across 15 character archetypes → 3 TTS engines (fast drafts, expressive English, 23-language multilingual) → Voice cloning from a short audio sample → Script editor with multi-speaker dialogue → Multi-track timeline for mixing and arranging → Professional mastering (LUFS normalization, de-esser, EQ, limiter) → Smart caching — edit one sentence, only that chunk regenerates → Export to WAV, MP3, FLAC, AAC Everything runs on your machine. Nothing gets uploaded. No per-character costs. No usage limits. And unlike cloud services, you don't pay credits just to hear how a change sounds — Vois caches everything, so iterations are instant. The tech: Native Rust backend. No Python, no Docker. The fast engine generates at 6x real-time on Apple Silicon. Pricing: $29/month or $9/month on the annual plan. Free tier gives you 10 generations per day with full access to all voices and engines — no feature gating. šŸš€ Launch offer for Product Hunt: 40% off the annual plan — $65/year ($5.40/mo). Code: PRODUCTHUNT. Valid until March 9. → https://vois.so/checkout?plan=ye... I'd love to hear how you'd use something like this — whether it's accessibility, content creation, game dialogue, or something I haven't thought of yet. I'll be here all day answering every question. šŸ™ — Praney
Kimberly Ross

@praney_behlĀ Hi Praney. Congrats on the launch. What datasets were used to train the voice models?

Praney Behl

@kimberly_ross Thanks! Great question. The TTS engines use models trained on publicly available speech datasets commonly used in speech synthesis research, clean, studio-quality speech corpora.

The 63+ production voices in the library were created using voice design techniques (generating voice characteristics from text descriptions) - they're not clones of real people.

For the voice cloning feature, the app requires users to confirm they have the voice owner's explicit consent before processing.

Happy to go deeper on any of this!

Abhinav Ramesh

Super! Your back story is inspiring, and congrats on the launch. Will give it a shot and let you know my feedback :)

Praney Behl

@abhinavrameshĀ Thanks Abhinav, I look forward to it. I hope you enjoy trying the app as much as I enjoyed building it.

Clement Ozemoya

Sounds amazing, this could tie into my voice server for claude code.

One question, does it support African voices and intonations? Big gap here industry-wide.

Praney Behl

@clement_ozemoyaĀ Absolutely, we are launching agent skill and accompanying vois-cli for programatic access soon.

Elvis Bueno

The no uploads angle is the one that would actually sell me — I never loved the idea of sending scripts to a cloud service just to get audio back. How does the voice quality hold up on longer form content like a full chapter of an ebook? That's usually where these tools start to sound robotic.

Praney Behl

@zerodarkhubĀ Nice. Well you can virtually go as long as you want. Vois has a built-in complex optimization and memory management module that keeps everything in check and functional. The scripts functionality lets you split chapters Individually not only for management but also for export control. Ultimately it comes down to how powerful your computer is, the one that is running Vois, that would dictate the time it takes to export, but Vois has been optimized to work with decent speeds on even older computers. Actually as a matter of fact, all the Vois tutorials and demo videos the narration has been created within Vois itself. Not just narration but also the background music and effects are also done within the Vois app. Vois is Free to try!

https://vois.so/tutorials