🗣️ Text-to-speech using GLM-TTS for generating audio
Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output w...
Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output w...
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
GLM-TTS brings high-quality text-to-speech generation into your agent workflow using the GLM-TTS service. It converts text into natural-sounding audio with three built-in voices, adjustable speed and volume, and support for custom voice cloning — enabling agents to produce spoken content for accessibility, presentations, podcasts, or any audio output need.
The skill uses the
uvx zai-tts CLI tool to interface with the GLM-TTS service. Users provide text content (directly or from a file), select a voice profile, and optionally adjust speed and volume parameters. The service processes the text and returns a WAV audio file saved to the local filesystem. Authentication requires two credentials — a user ID and token — extracted from the audio.z.ai browser interface.
Install the
uv package manager via Homebrew or pip to access the uvx binary. Configure two environment variables — ZAI_AUDIO_USERID and ZAI_AUDIO_TOKEN — obtained from the browser developer console at audio.z.ai. Then run uvx zai-tts --text "Your text here" --voice lila to generate your first audio file.MIT-0 (Free to use, modify, and redistribute. No a
No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.