Marek Serafin

Spoke - Private voice-to-text for macOS. Hold a key, speak, done.

byβ€’
Spoke is a macOS app that transcribes your voice into any text field. It runs a local speech model β€” no audio leaves your device. Hold a keyboard shortcut, speak, and the text appears wherever your cursor is. Optionally connect an AI provider to process transcriptions on the fly.

Add a comment

Replies

Best
Marek Serafin
Maker
πŸ“Œ

Hey everyone! I built Spoke because every dictation app I tried was overloaded with features I didn't need. I just wanted something simple β€” hold a key, talk, text appears. No menus to navigate, no modes to pick, no friction.
Spoke runs a local speech model so nothing leaves your Mac, but honestly the main goal was speed and simplicity. It works in any text field β€” Slack, iMessage, Claude, terminal, whatever you have focused. Hold the shortcut, speak, let go. That's it.
If you want more, you can plug in your own API key for OpenAI, Anthropic, or Gemini to process your speech on the fly. But the core experience is just fast, no-nonsense voice to text.
My favorite feature is auto-return β€” it presses Enter after you finish speaking, so in chat apps your message just sends. Feels like a walkie-talkie.
Happy to answer any questions about how it works or what's next.

I have a license code, limited to 50 uses:

0B22B577-26C7-467E-A4A7-E05F990F7299

No rush though β€” with how crowded this space is, those 50 codes are probably good for the lifetime of the app πŸ˜‚

Douglas Evaristo

Hey @stoprocent, reeally nice work on this! The product identity and design look great and I think that matters a lot in a space that's getting pretty crowded with similar tools.

I also like the business model. The one-time purchase is refreshing compared to the endless subscriptions we usually see.

I’ve been testing it and the experience feels really simple and fast, which is exactly what you described.

The only thing that made me a bit sad is that Portuguese isn’t supported yet, it’s my native language and I still use it a lot πŸ˜… Hopefully it can be added in the future!!

Marek Serafin

Thanks @douglasevaristo, that really means a lot β€” design and simplicity were the things I obsessed over most, so glad it comes through!

Portuguese is definitely on the list. The underlying Parakeet model has decent multilingual coverage and I've been testing it β€” the main challenge is making sure quality is consistent enough before I officially call it supported. I'll reach out when it's ready for some native-speaker testing if you're up for it

Tony Shishov

Nice launch!

When I am opening the app, I miss the tutorial how to use LLM formatting or translation. If it's possible, can you please add the information into onboarding?

Marek Serafin

@tony_shishovΒ sure thing I will record some demo video on this.

Nicholas Greenweck
🧐 Good find

Is this coming to Windows and iOS? In the very short time I've used Spoke, it's my favorite transcription app.

Also, your purchase page is in test mode, so it appears I'm unable to buy a license.

Marek Serafin

@nicholas_greenweckΒ That means a lot β€” thank you! πŸ™

And sorry about the checkout link β€” I was so focused on shipping the app that I completely forgot to test the payment flow πŸ˜…

The correct link is here: https://buy.usespoke.app/checkout/buy/22707bf3-6bef-4804-a6c2-12f971677f78


Use code UYNJEYOQ for 50% off β€” valid until Wednesday!

To your question: yes, iOS is in the works! My plan is to have the Mac and iOS versions complement each other rather than just be ports β€” so they work together as a system. More details coming soon. Stay tuned!

Shaun Hurley

I love how clean/simple this is β€” brilliant! What I really appreciate is that I can continue letting my music play while I dictate. Any plans on eventually setting this up so the text appears as it's being spoken (instead of at the end)? Or maybe that makes it more complicated and slows things down? Either way, thanks for sharing this. I'm really enjoying it!

Marek Serafin

Thanks @shaun_hurleyΒ , really glad it's working well for you

On real-time word appearance β€” great question, and the short answer is: it's a deliberate tradeoff rather than a missing feature. Spoke uses "attention-based" (non-streaming) model. It processes audio over a sliding context window and resolves word boundaries, punctuation, and meaning after hearing enough context. That's actually why the accuracy is so high β€” the model can "look back" and correct itself as you keep speaking. If I forced word-by-word output, you'd get a lot of jitter and corrections mid-sentence, which ends up feeling worse in practice. Think of it like autocorrect that only kicks in once you finish typing a word β€” the full context is what makes it smart. It's on my radar to explore partial output for longer dictations, but it won't be true streaming in the Whisper-style sense.

Brianna Lin

Nice! How’s the accuracy with the on-device model, especially for punctuation and formatting?

Marek Serafin

@brianna_linΒ Accuracy is genuinely the thing I'm most proud of. Parakeet (the model Spoke runs locally) is one of the top-performing English ASR models available right now β€” it competes well with cloud-based alternatives on standard benchmarks. Punctuation is solid for natural speech; it handles sentence boundaries well if you speak at a normal pace. Formatting is more contextual β€” things like lists or code won't auto-format since that's not what a speech model does, but if you connect an AI provider you can add a prompt to handle that on the fly. For everyday dictation β€” messages, notes, emails β€” it just works cleanly out of the box.