Voice
Convert text to speech using Microsoft Edge's TTS engine with customizable voices, direct playback, and automatic temporary file cleanup.
Convert text to speech using Microsoft Edge's TTS engine with customizable voices, direct playback, and automatic temporary file cleanup.
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
The Voice skill provides enhanced text-to-speech functionality using edge-tts, allowing you to convert text to spoken audio with multiple playback options.
Before using this skill, you need to install the required dependency:
pip3 install edge-tts
Or use the skill's install action:
await skill.execute({ action: 'install' });
Speak text directly without storing to file:
const result = await skill.execute({ action: 'speak', // New improved action text: 'Hello, how are you today?' }); // Audio is played directly and temporary file is cleaned up automatically
Convert text to speech with default settings:
const result = await skill.execute({ action: 'tts', text: 'Hello, how are you today?' }); // Returns a MEDIA link to the audio file
With direct playback:
const result = await skill.execute({ action: 'tts', text: 'Hello, how are you today?', playImmediately: true // Plays the audio immediately after generation });
With custom options:
const result = await skill.execute({ action: 'tts', text: 'This is a sample of voice customization.', options: { voice: 'zh-CN-XiaoxiaoNeural', rate: '+10%', volume: '-5%', pitch: '+10Hz' } });
Play an existing audio file:
const result = await skill.execute({ action: 'play', filePath: '/path/to/audio/file.mp3' });
Get a list of available voices:
const result = await skill.execute({ action: 'voices' });
Clean up temporary audio files older than 1 hour (default):
const result = await skill.execute({ action: 'cleanup' });
Or specify a custom age threshold:
const result = await skill.execute({ action: 'cleanup', options: { hoursOld: 2 // Clean files older than 2 hours } });
The following options are available for text-to-speech:
voice: The voice to use (default: 'zh-CN-XiaoxiaoNeural')rate: Speech rate adjustment (default: '+0%')volume: Volume adjustment (default: '+0%')pitch: Pitch adjustment (default: '+0Hz')Edge-TTS supports many voices in different languages:
temp directorypip3 install edge-tts)No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.