E
ElevenLabs Scribe
World's most accurate speech-to-text model - 96.7% accuracy in 99 languages
Features
Real-time Transcription
Yes
Speaker Identification
Yes
AI Summaries
No
Action Items
No
Core Features
- Speech-to-text transcription (Scribe v1)
- Real-time transcription (Scribe v2, 150ms latency)
- Word-level timestamps
- Speaker diarization (up to 32 speakers)
- Audio event tagging
- Support for 99 languages
- File upload (audio/video up to 3GB, 10 hours)
- API access
- Voice generation (v3)
- Voice cloning
Languages
99
Pricing
Pay-as-you-go
$0.40 per hour
Transcription only, scales down with volume
Enterprise
Custom (6,000+ hours/month)
Custom MSAs, DPAs, reduced pricing at scale
Pricing model: Usage_based
Integrations
User Reviews
No user reviews yet. Be the first to write a review!