DeepMind Blog·2 min read

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Gemini 3.1 Flash TTS enhances AI speech capabilities.

DeepMind has launched Gemini 3.1 Flash TTS, a cutting-edge AI speech model that significantly enhances the quality and expressiveness of AI-generated speech. This model introduces granular audio tags, allowing users to manipulate vocal style and pacing in over 70 languages, while all generated audio is embedded with SynthID watermarks to combat misinformation.

Key Takeaways

  • 1.

    Gemini 3.1 Flash TTS supports over 70 languages with enhanced vocal control.

  • 2.

    The model achieved an Elo score of 1,211 on the Artificial Analysis TTS leaderboard.

Get your personalized feed

Trace groups the biggest stories, videos, and discussions into one feed so you can stay current without scanning ten tabs.

Try Trace free
Gemini 3.1 Flash TTS: the next generation of expressive AI speech | Trace