Chubby♨️

kimmonismus

@sylv_1_ retweeted Mistral AI released Voxtral TTS, a 3-billion-parameter text-to-speech model with open weights that the company says outperformed ElevenLabs Flash v2.5 in human preference tests roughly 63% of the time on standard voices and nearly 70% on voice customization.The model runs on about 3 GB of RAM, achieves 90-millisecond time-to-first-audio, supports nine languages, and can clone a voice from just five seconds of reference audio, including cross-lingual adaptation that preserves the speaker's accent. Posted Mar 26, 2026 at 12:49PM