Phone Voice Agent
Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to
Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Runs a local FastAPI server that acts as a real-time voice bridge.
Twilio (Phone) <--> WebSocket (Audio) <--> [Local Server] <--> Deepgram (STT) | +--> OpenAI (LLM) +--> ElevenLabs (TTS)
Install Dependencies:
pip install -r scripts/requirements.txt
Set Environment Variables (in
~/.moltbot/.env, ~/.clawdbot/.env, or export):
export DEEPGRAM_API_KEY="your_key" export OPENAI_API_KEY="your_key" export ELEVENLABS_API_KEY="your_key" export TWILIO_ACCOUNT_SID="your_sid" export TWILIO_AUTH_TOKEN="your_token" export PORT=8080
Start the Server:
python3 scripts/server.py
Expose to Internet:
ngrok http 8080
Configure Twilio:
https://<your-ngrok-url>.ngrok.io/incomingPOSTCall your Twilio number. The agent should answer, transcribe your speech, think, and reply in a natural voice.
SYSTEM_PROMPT in scripts/server.py to change the persona.ELEVENLABS_VOICE_ID to use different voices.gpt-4o-mini to gpt-4 for smarter (but slower) responses.No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.