AI Skill Market Insights

Real data. Real impact.

Popularity

Top 50%

Growing

Active Users

2,142+

Developers

Time Saved

2+ hrs

Per week

Source

GitHub

Open source

Be Part of the 2,142+ Developer Community

Skills give you superpowers. Install in 30 seconds.

Pocket TTS Skill

Fully local, offline text-to-speech using Kyutai's Pocket TTS model. Generate high-quality audio from text without any API calls or internet connection. Features 8 built-in voices, voice cloning support, and runs entirely on CPU.

Features

🎯 Fully local - No API calls, runs completely offline
🚀 CPU-only - No GPU required, works on any computer
⚡ Fast generation - ~2-6x real-time on CPU
🎤 8 built-in voices - alba, marius, javert, jean, fantine, cosette, eponine, azelma
🎭 Voice cloning - Clone any voice from a WAV sample
🔊 Low latency - ~200ms first audio chunk
📚 Simple Python API - Easy integration into any project

Installation

# 1. Accept the model license on Hugging Face
# https://huggingface.co/kyutai/pocket-tts
2. Install the package
pip install pocket-tts
Or use uv for automatic dependency management

uvx pocket-tts generate "Hello world"

Usage

CLI

# Basic usage
pocket-tts "Hello, I am your AI assistant"
With specific voice
pocket-tts "Hello" --voice alba --output hello.wav
With custom voice file (voice cloning)
pocket-tts "Hello" --voice-file myvoice.wav --output output.wav
Adjust speed
pocket-tts "Hello" --speed 1.2
Start local server
pocket-tts --serve
List available voices

pocket-tts --list-voices

Python API

from pocket_tts import TTSModel
import scipy.io.wavfile
Load model
tts_model = TTSModel.load_model()
Get voice state
voice_state = tts_model.get_state_for_audio_prompt(
"hf://kyutai/tts-voices/alba-mackenna/casual.wav"
)
Generate audio
audio = tts_model.generate_audio(voice_state, "Hello world!")
Save to WAV
scipy.io.wavfile.write("output.wav", tts_model.sample_rate, audio.numpy())
Check sample rate
print(f"Sample rate: {tts_model.sample_rate} Hz")

Available Voices

Voice	Description
alba	Casual female voice
marius	Male voice
javert	Clear male voice
jean	Natural male voice
fantine	Female voice
cosette	Female voice
eponine	Female voice
azelma	Female voice

Or use

--voice-file /path/to/wav.wav

for custom voice cloning.

Options

Option	Description	Default
`text`	Text to convert	Required
`-o, --output`	Output WAV file	`output.wav`
`-v, --voice`	Voice preset	`alba`
`-s, --speed`	Speech speed (0.5-2.0)	`1.0`
`--voice-file`	Custom WAV for cloning	None
`--serve`	Start HTTP server	False
`--list-voices`	List all voices	False

Pocket Tts

AI Skill Market Insights

Be Part of the 2,142+ Developer Community

Pocket TTS Skill

Features

Installation

2. Install the package

Or use uv for automatic dependency management

Usage

CLI

With specific voice

With custom voice file (voice cloning)

Adjust speed

Start local server

List available voices

Python API

Load model

Get voice state

Generate audio

Save to WAV

Check sample rate

Available Voices

Options

Requirements

Notes

Links

Quick Start

Manual Installation

TEAR & SHARE

Tags

plan-design-review

design-review

design-html

design-shotgun

design-consultation

Channels

Learn

Compare

Company