Fish Audio is the most expressive and emotionally rich text-to-speech model. It generates lifelike voices that capture emotion, rhythm, and nuance with remarkable realism. Fish Audio Voice Clone recreates a natural voice from just 10 seconds of audio—preserving accent, tone, and speaking habits. Proudly built by the open-source team behind So-VITS-SVC and Bert-VITS2, giving a soul to every voice.
This is the 4th launch from Fish Audio. View more

Fish Audio S2
Launched this week
We've open-sourced Fish Audio S2, a new generation of expressive TTS that lets you direct voices with natural language. Add cues like [whisper] or [laughing nervously], generate multi-speaker dialogue in one pass, and create scary-real voices across 80+ languages.






Free
Launch Team / Built With








This is a top tier quality, it's my new favorite 🤯. And it's only a 4.5B model too 😯 incredible!
Great blog post too. I can't wait to get some time to go over the technical report you released.
Been using this to enhance my DND games.. its transformed it! allocating voices to actual NPC parts is awesome.. highly recommended and much fairer priced then Elevenlabs - highly recommended!!!
Fish Audio
https://x.com/i/trending/2031460658311737490
big fish audio fans for a long time, been witness the team always go above and beyond. let's gooooo s2! congrats on this launch
Fish Audio
@kellyann3644 Thank you Kelly for the long time support. We appreciate you so much <3
Flow GPT
Good job!
Fish Audio
@lifan_wang Thanks for your support Lifan! Hope you have fun trying it out, let us know your thoughts!
Adjust Page Brightness - Smart Control
this is called gold mate! keep making more such products like these
Fish Audio
@kshitij_mishra4 thanks man!!
HakkoAI
exactly what we need, gonna try it now
Fish Audio