E

ElevenLabs Scribe

World's most accurate speech-to-text model - 96.7% accuracy in 99 languages

Excellent Swedish Support

Features

Real-time Transcription Yes
Speaker Identification Yes
AI Summaries No
Action Items No

Core Features

  • Speech-to-text transcription (Scribe v1)
  • Real-time transcription (Scribe v2, 150ms latency)
  • Word-level timestamps
  • Speaker diarization (up to 32 speakers)
  • Audio event tagging
  • Support for 99 languages
  • File upload (audio/video up to 3GB, 10 hours)
  • API access
  • Voice generation (v3)
  • Voice cloning

Languages

99

Pricing

🎁
Free Tier Available

Free tier requires attribution, no commercial licensing

Pay-as-you-go

$0.40 per hour

Transcription only, scales down with volume

Enterprise

Custom (6,000+ hours/month)

Custom MSAs, DPAs, reduced pricing at scale

Pricing model: Usage_based

Integrations

API-based (custom integrations possible)

User Reviews

No user reviews yet. Be the first to write a review!

Quick feedback

Hi! Do you use ElevenLabs Scribe?