ElevenLabs
Token usage over time
Browse models from ElevenLabs
Eleven Multilingual v2 (Free)
Eleven-Multilingual-v2 is ElevenLabs’ most advanced multilingual text-to-speech model, delivering high-quality voice synthesis across a wide range of languages with improved realism and expressiveness. It is optimized for both accuracy and naturalness in multilingual scenarios.
Eleven v3
Eleven-v3 is ElevenLabs’ most expressive text-to-speech model, supporting over 70 languages, multi-speaker dialogue, and advanced audio tags such as [excited], [whispers], [laughs], and [sighs]. It provides unmatched realism and control, enabling dynamic, context-aware conversations with improved expressiveness and fine-grained audio control.
Scribe v1
Scribe-v1 is a cutting-edge speech recognition model from ElevenLabs, designed for accurate speech-to-text transcription in 99 languages. It excels at handling real-world audio and consistently outperforms models such as Gemini 2.0 Flash and Whisper Large V3, achieving notably low word error rates even in underserved languages.
Eleven Multilingual v2
Eleven-Multilingual-v2 is ElevenLabs’ most advanced multilingual text-to-speech model, delivering high-quality voice synthesis across a wide range of languages with improved realism and expressiveness. It is optimized for both accuracy and naturalness in multilingual scenarios.
Eleven Monolingual v1
Eleven-Monolingual-v1 is an English-only TTS model from ElevenLabs, providing clear, natural-sounding voice output for a variety of English-language applications.
Eleven Turbo v2
Eleven-Turbo-v2 is an English-optimized TTS model from ElevenLabs, designed for fast, high-quality speech synthesis with low latency. It is ideal for real-time applications and interactive voice systems.
Eleven Multilingual v1
Eleven-Multilingual-v1 is an earlier multilingual TTS model from ElevenLabs, offering robust support for multiple languages and reliable, natural-sounding voice generation.