Fish Audio

Fish Audio

Expressive Text-to-Speech and Voice Cloning

4.5
6 reviews

891 followers

Fish Audio is the most expressive and emotionally rich text-to-speech model. It generates lifelike voices that capture emotion, rhythm, and nuance with remarkable realism. Fish Audio Voice Clone recreates a natural voice from just 10 seconds of audio—preserving accent, tone, and speaking habits. Proudly built by the open-source team behind So-VITS-SVC and Bert-VITS2, giving a soul to every voice.

Fish Audio launches

Launch date
Fish Audio S2
Fish Audio S2 Real Expressive AI Voices

Launched on March 10th, 2026

Fish Audio S1
Fish Audio S1 Expressive Voice Cloning and Text-to-Speech

Launched on October 20th, 2025

Fish Speech 1.4
Fish Speech 1.4 Open-Source Multilingual Text-to-Speech with Voice Cloning

Launched on September 11th, 2024

Fish Speech
Fish Speech Few-shot voice cloning and text-to-speech

Launched on July 18th, 2024