ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

Join 0+ developers using this skill

skill

Creative & Media

beginner

0 installs

View Source

Last updated: April 23, 2026

AI Skill Market Insights

Real data. Real impact.

Popularity

Rising

Emerging

Active Users

Developers

Quick Start

Manual Installation

No automatic installation available. Please visit the source repository for installation instructions.

View Installation Instructions

TEAR & SHARE

3-5hrs/WK

RISING

0.0K+ USING

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs' Scribe v2 model. Supports 90+ languages with speaker diarization.

Quick Start

# Basic transcription
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3
With speaker diarization
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --diarize
Specify language (improves accuracy)
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --lang en
Full JSON output with timestamps
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --json

Options

Flag	Description
`--diarize`	Identify different speakers
`--lang CODE`	ISO language code (e.g., en, pt, es)
`--json`	Output full JSON with word timestamps
`--events`	Tag audio events (laughter, music, etc.)

Supported Formats

All major audio/video formats: mp3, m4a, wav, ogg, webm, mp4, etc.

API Key

Set

ELEVENLABS_API_KEY

environment variable, or configure in clawdbot.json:

{
  skills: {
    entries: {
      "elevenlabs-stt": {
        apiKey: "sk_..."
      }
    }
  }
}

Examples

# Transcribe a WhatsApp voice note
{baseDir}/scripts/transcribe.sh ~/Downloads/voice_note.ogg
Meeting recording with multiple speakers
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize --lang en
Get JSON for processing
{baseDir}/scripts/transcribe.sh podcast.mp3 --json > transcript.json

ElevenLabs Speech-to-Text

AI Skill Market Insights

Quick Start

Manual Installation

TEAR & SHARE

Tags

Chart MCP Server

Be Part of the 0+ Developer Community

ElevenLabs Speech-to-Text

Quick Start

With speaker diarization

Specify language (improves accuracy)

Full JSON output with timestamps

Options

Supported Formats

API Key

Examples

Meeting recording with multiple speakers

Get JSON for processing

Douyin MCP Server

KiCad MCP Server

Shadcn UI MCP Server

Drawio MCP Server

Channels

Learn

Compare

Company

Agents