Pocket Tts
Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket TTS model.
Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket TTS model.
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Fully local, offline text-to-speech using Kyutai's Pocket TTS model. Generate high-quality audio from text without any API calls or internet connection. Features 8 built-in voices, voice cloning support, and runs entirely on CPU.
# 1. Accept the model license on Hugging Face # https://huggingface.co/kyutai/pocket-tts2. Install the package
pip install pocket-tts
Or use uv for automatic dependency management
uvx pocket-tts generate "Hello world"
# Basic usage pocket-tts "Hello, I am your AI assistant"With specific voice
pocket-tts "Hello" --voice alba --output hello.wav
With custom voice file (voice cloning)
pocket-tts "Hello" --voice-file myvoice.wav --output output.wav
Adjust speed
pocket-tts "Hello" --speed 1.2
Start local server
pocket-tts --serve
List available voices
pocket-tts --list-voices
from pocket_tts import TTSModel import scipy.io.wavfileLoad model
tts_model = TTSModel.load_model()
Get voice state
voice_state = tts_model.get_state_for_audio_prompt( "hf://kyutai/tts-voices/alba-mackenna/casual.wav" )
Generate audio
audio = tts_model.generate_audio(voice_state, "Hello world!")
Save to WAV
scipy.io.wavfile.write("output.wav", tts_model.sample_rate, audio.numpy())
Check sample rate
print(f"Sample rate: {tts_model.sample_rate} Hz")
| Voice | Description |
|---|---|
| alba | Casual female voice |
| marius | Male voice |
| javert | Clear male voice |
| jean | Natural male voice |
| fantine | Female voice |
| cosette | Female voice |
| eponine | Female voice |
| azelma | Female voice |
Or use
--voice-file /path/to/wav.wav for custom voice cloning.
| Option | Description | Default |
|---|---|---|
| Text to convert | Required |
| Output WAV file | |
| Voice preset | |
| Speech speed (0.5-2.0) | |
| Custom WAV for cloning | None |
| Start HTTP server | False |
| List all voices | False |
No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.