Elevenlabs Transcribe
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Official ElevenLabs skill for speech-to-text transcription.
Convert audio to text with state-of-the-art accuracy. Supports 90+ languages, speaker diarization, and realtime streaming.
brew install ffmpeg on macOS){baseDir}/scripts/transcribe.sh <audio_file> [options] {baseDir}/scripts/transcribe.sh --url <stream_url> [options] {baseDir}/scripts/transcribe.sh --mic [options]
Transcribe a local audio file:
{baseDir}/scripts/transcribe.sh recording.mp3
With speaker identification:
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize
Get full JSON response with timestamps:
{baseDir}/scripts/transcribe.sh interview.wav --diarize --json
Stream from a URL (e.g., live radio, podcast):
{baseDir}/scripts/transcribe.sh --url https://npr-ice.streamguys1.com/live.mp3
Transcribe from microphone:
{baseDir}/scripts/transcribe.sh --mic
Stream a local file in realtime (useful for testing):
{baseDir}/scripts/transcribe.sh audio.mp3 --realtime
Suppress status messages on stderr:
{baseDir}/scripts/transcribe.sh --mic --quiet
| Option | Description |
|---|---|
| Identify different speakers in the audio |
| ISO language hint (e.g., , , , ) |
| Output full JSON with timestamps and metadata |
| Tag audio events (laughter, music, applause) |
| Stream local file instead of batch processing |
| Show interim transcripts during realtime mode |
| Suppress status messages (recommended for agents) |
Plain text transcription:
The quick brown fox jumps over the lazy dog.
--json){ "text": "The quick brown fox jumps over the lazy dog.", "language_code": "eng", "language_probability": 0.98, "words": [ {"text": "The", "start": 0.0, "end": 0.15, "type": "word", "speaker_id": "speaker_0"} ] }
Final transcripts print as they're committed. With
--partials:
[partial] The quick [partial] The quick brown fox The quick brown fox jumps over the lazy dog.
Audio: MP3, WAV, M4A, FLAC, OGG, WebM, AAC, AIFF, Opus Video: MP4, AVI, MKV, MOV, WMV, FLV, WebM, MPEG, 3GPP
Limits: Up to 3GB file size, 10 hours duration
The script exits with non-zero status on errors:
ELEVENLABS_API_KEY environment variable| Scenario | Command |
|---|---|
| Transcribe a recording | |
| Meeting with multiple speakers | |
| Live radio/podcast stream | |
| Voice input from user | |
| Need word timestamps | |
No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.