Clonev
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) C
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) C
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
DO NOT try to use Docker containers directly. DO NOT try to interact with
coqui-xtts container - it is broken and restarting.
DO NOT try to use APIs or servers.
ONLY USE THE SCRIPT:
scripts/clonev.sh
The script handles everything automatically. Just call it with text, voice sample, and language.
Clones any voice from a short audio sample and generates new speech in that voice.
Input:
Output: OGG voice file (cloned voice speaking the text)
Works with: Any voice! Yours, a celebrity, a character, etc.
$(scripts/clonev.sh "Your text here" /path/to/voice_sample.wav language)
That's it! Nothing else needed.
en)VOICE_FILE=$(scripts/clonev.sh "TEXT_HERE" "/path/to/sample.wav" LANGUAGE)
The variable
$VOICE_FILE now contains the path to the generated OGG file.
# Generate cloned voice VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Hello, this is my cloned voice!" "/mnt/c/TEMP/Recording 25.wav" en)Send to Telegram (as voice message)
message action=send channel=telegram asVoice=true filePath="$VOICE"
# Generate Czech voice VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Ahoj, tohle je můj hlas" "/mnt/c/TEMP/Recording 25.wav" cs)Send
message action=send channel=telegram asVoice=true filePath="$VOICE"
#!/bin/bashGenerate voice
VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Task completed!" "/path/to/sample.wav" en)
Verify file was created
if [ -f "$VOICE" ]; then echo "Success! Voice file: $VOICE" ls -lh "$VOICE" else echo "Error: Voice file not created" fi
| Code | Language | Example Usage |
|---|---|---|
| English | |
| Czech | |
| German | |
| French | |
| Spanish | |
Full list: en, cs, de, fr, es, it, pl, pt, tr, ru, nl, ar, zh, ja, hu, ko
Good samples:
Bad samples:
/mnt/c/TEMP/Docker-containers/coqui-tts/models-xtts/Make sure you're in the skill directory or use full path:
/home/bernie/clawd/skills/clonev/scripts/clonev.sh "text" sample.wav en
/)ls -la /path/to/sample.wavThe model should auto-download. If not:
cd /mnt/c/TEMP/Docker-containers/coqui-tts docker run --rm --entrypoint "" \ -v $(pwd)/models-xtts:/root/.local/share/tts \ ghcr.io/coqui-ai/tts:latest \ python3 -c "from TTS.api import TTS; TTS('tts_models/multilingual/multi-dataset/xtts_v2')"
USER: "Clone my voice and say 'hello'" → Get: sample path, text="hello", language="en" → Run: VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "hello" "/path/to/sample.wav" en) → Result: $VOICE contains path to OGG file → Send: message action=send channel=telegram asVoice=true filePath="$VOICE"
USER: "Make me speak Czech" → Get: sample path, text="Ahoj", language="cs" → Run: VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Ahoj" "/path/to/sample.wav" cs) → Send: message action=send channel=telegram asVoice=true filePath="$VOICE"
Generated files are saved to:
/mnt/c/TEMP/Docker-containers/coqui-tts/output/clonev_output.ogg
The script returns this path, so you can use it directly.
scripts/clonev.shcoqui-xtts containerSimple. Just use the script.
Clone any voice. Speak any language. Just use the script.
No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.