
Text to Speech API
Ultra-realistic and low latency speech generation
Build with high-quality, controllable speech generation for real-time and bulk applications. Models optimized for latency, fidelity, and long-form consistency.
- Lovable
- Synthesia
- Stripe
- Perplexity
- Twilio
- Lovable
- Synthesia
- Stripe
- Perplexity
- Twilio
Built on the most powerful Voice AI models
Choose the right model for your use case: from ultra-low latency agents to expressive, long-form narration.

Flash v2.5
Our lowest latency speech synthesis model
- Ultra-low latency (~75ms)
- 32 languages supported
- 40,000 character limit
- ~$0.06 per minute

Turbo v2.5
Balanced quality and latency
- Low latency (~250-300ms)
- High quality voice generation
- 32 languages supported
- 40,000 character limit
- ~$0.06 per minute

Multilingual v2
Lifelike, consistent quality speech synthesis model
- Natural-sounding output
- 29 languages supported
- 10,000 character limit
- Designed for long-form generations
- ~$0.12 per minute

Eleven v3
Our most emotionally rich, expressive model
- Dramatic delivery and performance
- 70+ languages supported
- 3,000 character limit
- Multi-speaker dialogue
- ~$0.12 per minute
Everything you need to build production-ready speech
Generate expressive, controllable speech with models built for real-time, long-form, and production use.
Control emotion and delivery

Access 10,000+ voices

Voice design & cloning

Multi-speaker dialogue

Audio events and direction

Pronunciation dictionaries

Powering world’s leading companies and brands
“From dubbing Reels in local languages, to generating music and character voices in Horizon, ElevenLabs platform enables global creators, businesses, and enterprises to build with voice, music, and sound at scale.”
“Millions of people learn chess from creators like Hikaru, Levy, and Magnus every day on YouTube and Twitch. Now you can learn from them inside Chess.com in a way that feels immersive, personal, and full of character. Our mission is to build a chess coach that teaches at the right level, welcomes players of every skill level, and demystifies chess while keeping it fun and full of personality. With ElevenLabs and these amazing new voices, we’ve taken a big step toward making that vision a reality.”
“ElevenLabs made it easy for us to quickly bring powerful text-to-speech capabilities to our SDK, allowing Agents to respond in real time with expressive voices to user questions or as feedback to what it’s seeing.”

“Twilio has integrated ElevenLabs’ generative AI voice technology into its CPaaS, enhancing ConversationRelay. This integration allows businesses and developers to create conversational AI voice interactions that sound human, feel expressive, and respond in real time directly from the Twilio CPaaS platform. We at ElevenLabs are excited that Twilio has chosen ElevenLabs to enhance ConversationRelay with the most expressive, human sounding voices available. ”
APIs built for production

Frequently asked questions
Latest updates


Elevenlabs OSS Engineers Fund: supporting the open-source projects that shape our work
.webp&w=3840&q=80)

Introducing ElevenLabs UI: Open-source audio & agent components for the web
.webp&w=3840&q=80)
ElevenLabs Agents vs OpenAI Realtime API: Conversational Agents Showdown


.webp&w=3840&q=80)
Building Vibe Draw: combining ElevenLabs with FLUX Kontext for voice-powered image creation
.webp&w=3840&q=80)
How I built a text-to-commercial generator using ElevenLabs, Gemini, and VEO 2