scribe-v1
by elevenlabsScribe-v1 is a cutting-edge speech recognition model from ElevenLabs, designed for accurate speech-to-text transcription in 99 languages. It excels at handling real-world audio and consistently outperforms models such as Gemini 2.0 Flash and Whisper Large V3, achieving notably low word error rates even in underserved languages.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Transcription (1 minute)
$0.0025
Capabilities
Input Modalities
Audio
Output Modalities
Text
Rate Limits
Requests per minute (RPM) and per day (RPD) by tier. More about tiers here
Tier | RPM | RPD |
---|---|---|
Free | — | — |
Tier 1 | 10 | — |
Tier 2 | 15 | — |
Tier 3 | 25 | — |
Tier 4 | 50 | — |
Usage Analytics
Token usage across the last 30 active days