Launched this week
Spoke
Private voice-to-text for macOS. Hold a key, speak, done.
98 followers
Private voice-to-text for macOS. Hold a key, speak, done.
98 followers
Spoke is a macOS app that transcribes your voice into any text field. It runs a local speech model β no audio leaves your device. Hold a keyboard shortcut, speak, and the text appears wherever your cursor is. Optionally connect an AI provider to process transcriptions on the fly.






Hey everyone! I built Spoke because every dictation app I tried was overloaded with features I didn't need. I just wanted something simple β hold a key, talk, text appears. No menus to navigate, no modes to pick, no friction.
Spoke runs a local speech model so nothing leaves your Mac, but honestly the main goal was speed and simplicity. It works in any text field β Slack, iMessage, Claude, terminal, whatever you have focused. Hold the shortcut, speak, let go. That's it.
If you want more, you can plug in your own API key for OpenAI, Anthropic, or Gemini to process your speech on the fly. But the core experience is just fast, no-nonsense voice to text.
My favorite feature is auto-return β it presses Enter after you finish speaking, so in chat apps your message just sends. Feels like a walkie-talkie.
Happy to answer any questions about how it works or what's next.
I have a license code, limited to 50 uses:
No rush though β with how crowded this space is, those 50 codes are probably good for the lifetime of the app π
Hey @stoprocent, reeally nice work on this! The product identity and design look great and I think that matters a lot in a space that's getting pretty crowded with similar tools.
I also like the business model. The one-time purchase is refreshing compared to the endless subscriptions we usually see.
Iβve been testing it and the experience feels really simple and fast, which is exactly what you described.
The only thing that made me a bit sad is that Portuguese isnβt supported yet, itβs my native language and I still use it a lot π Hopefully it can be added in the future!!
Thanks @douglasevaristo, that really means a lot β design and simplicity were the things I obsessed over most, so glad it comes through!
Portuguese is definitely on the list. The underlying Parakeet model has decent multilingual coverage and I've been testing it β the main challenge is making sure quality is consistent enough before I officially call it supported. I'll reach out when it's ready for some native-speaker testing if you're up for it
Is this coming to Windows and iOS? In the very short time I've used Spoke, it's my favorite transcription app.
Also, your purchase page is in test mode, so it appears I'm unable to buy a license.
@nicholas_greenweckΒ That means a lot β thank you! π
And sorry about the checkout link β I was so focused on shipping the app that I completely forgot to test the payment flow π
The correct link is here: https://buy.usespoke.app/checkout/buy/22707bf3-6bef-4804-a6c2-12f971677f78
Use code UYNJEYOQ for 50% off β valid until Wednesday!
To your question: yes, iOS is in the works! My plan is to have the Mac and iOS versions complement each other rather than just be ports β so they work together as a system. More details coming soon. Stay tuned!
Nice launch!
When I am opening the app, I miss the tutorial how to use LLM formatting or translation. If it's possible, can you please add the information into onboarding?
@tony_shishovΒ sure thing I will record some demo video on this.
I love how clean/simple this is β brilliant! What I really appreciate is that I can continue letting my music play while I dictate. Any plans on eventually setting this up so the text appears as it's being spoken (instead of at the end)? Or maybe that makes it more complicated and slows things down? Either way, thanks for sharing this. I'm really enjoying it!
Thanks @shaun_hurleyΒ , really glad it's working well for you
On real-time word appearance β great question, and the short answer is: it's a deliberate tradeoff rather than a missing feature. Spoke uses "attention-based" (non-streaming) model. It processes audio over a sliding context window and resolves word boundaries, punctuation, and meaning after hearing enough context. That's actually why the accuracy is so high β the model can "look back" and correct itself as you keep speaking. If I forced word-by-word output, you'd get a lot of jitter and corrections mid-sentence, which ends up feeling worse in practice. Think of it like autocorrect that only kicks in once you finish typing a word β the full context is what makes it smart. It's on my radar to explore partial output for longer dictations, but it won't be true streaming in the Whisper-style sense.
Copperlane
Nice! Howβs the accuracy with the on-device model, especially for punctuation and formatting?
@brianna_linΒ Accuracy is genuinely the thing I'm most proud of. Parakeet (the model Spoke runs locally) is one of the top-performing English ASR models available right now β it competes well with cloud-based alternatives on standard benchmarks. Punctuation is solid for natural speech; it handles sentence boundaries well if you speak at a normal pace. Formatting is more contextual β things like lists or code won't auto-format since that's not what a speech model does, but if you connect an AI provider you can add a prompt to handle that on the fly. For everyday dictation β messages, notes, emails β it just works cleanly out of the box.