Fish Audio is the most expressive and emotionally rich text-to-speech model. It generates lifelike voices that capture emotion, rhythm, and nuance with remarkable realism. Fish Audio Voice Clone recreates a natural voice from just 10 seconds of audio—preserving accent, tone, and speaking habits. Proudly built by the open-source team behind So-VITS-SVC and Bert-VITS2, giving a soul to every voice.
This is the 4th launch from Fish Audio. View more

Fish Audio S2
Launched this week
We've open-sourced Fish Audio S2, a new generation of expressive TTS that lets you direct voices with natural language. Add cues like [whisper] or [laughing nervously], generate multi-speaker dialogue in one pass, and create scary-real voices across 80+ languages.






Free
Launch Team / Built With








This is a top tier quality, it's my new favorite 🤯. And it's only a 4.5B model too 😯 incredible!
Great blog post too. I can't wait to get some time to go over the technical report you released.
Fish Audio
https://x.com/i/trending/2031460658311737490
big fish audio fans for a long time, been witness the team always go above and beyond. let's gooooo s2! congrats on this launch
Fish Audio
@kellyann3644 Thank you Kelly for the long time support. We appreciate you so much <3
Flow GPT
Good job!
Fish Audio
@lifan_wang Thanks for your support Lifan! Hope you have fun trying it out, let us know your thoughts!
Adjust Page Brightness - Smart Control
this is called gold mate! keep making more such products like these
Fish Audio
@kshitij_mishra4 thanks man!!
HakkoAI
exactly what we need, gonna try it now
Fish Audio
As someone who used to lead a team that created dozens of voice overs for different market, these tools are a game-changer.