scribe-v1

by elevenlabs

Scribe-v1 is a cutting-edge speech recognition model from ElevenLabs, designed for accurate speech-to-text transcription in 99 languages. It excels at handling real-world audio and consistently outperforms models such as Gemini 2.0 Flash and Whisper Large V3, achieving notably low word error rates even in underserved languages.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Transcription (1 minute)

$0.0025

Capabilities

Input Modalities

Audio

Output Modalities

Text

Rate Limits

Requests per minute (RPM) and per day (RPD) by tier. More about tiers here

TierRPMRPD
Free
Tier 110
Tier 215
Tier 325
Tier 450

Usage Analytics

Token usage across the last 30 active days