Telephony Bot Example

An AI-powered phone call handler that transcribes caller speech, generates an LLM response, and returns synthesized audio — using Twilio for telephony, Deepgram for speech-to-text, and OpenAI TTS for speech synthesis.

Features

✅ Twilio webhook integration for inbound phone calls
✅ Real-time speech transcription via Deepgram
✅ LLM response generation with Ollama
✅ High-quality speech synthesis with OpenAI TTS
✅ Returns transcript, text answer, and audio file path

Prerequisites

You need:

A Twilio account with a phone number
A Deepgram API key (for transcription)
An OpenAI API key (for TTS)

Setup

1. Configure API Keys

Edit workflow.yaml and components/tts/component.yaml with your keys:

# workflow.yaml — Deepgram key for transcription
transcriber:
  online:
    apiKey: "dg-your-key-here"

# components/tts/component.yaml — OpenAI key for TTS
tts:
  online:
    apiKey: "sk-your-key-here"

2. Run the Server

# From examples/telephony-bot directory
kdeps run workflow.yaml --dev

# Or from project root
kdeps run examples/telephony-bot/workflow.yaml --dev

The server listens on http://0.0.0.0:16395.

3. Expose to the Internet

Twilio needs a publicly accessible URL. Use a tunnel for local development:

# ngrok (free tier)
ngrok http 16395

# cloudflared
cloudflared tunnel --url http://localhost:16395

4. Configure Twilio Webhook

In your Twilio Console:

Go to Phone Numbers → Active Numbers
Select your phone number
Under Voice & Fax → A Call Comes In, set:
- Webhook: https://your-tunnel-url.ngrok.io/api/v1/call
- HTTP Method: POST

How It Works

Call Flow

Caller → Twilio → Webhook POST /api/v1/call
                           ↓
                   Deepgram Transcription
                           ↓
                     LLM (llama3.2:1b)
                           ↓
                    OpenAI TTS (alloy)
                           ↓
              Response: transcript + answer + audio path

When a call comes in, Twilio sends the caller's audio to the workflow's webhook endpoint. KDeps:

Transcribes the audio using Deepgram
Sends the transcript to the LLM
Converts the LLM response to speech using OpenAI TTS
Returns a JSON response with the transcript, text answer, and audio file path

Response

{
  "success": true,
  "data": {
    "transcript": "What is the weather like today?",
    "answer": "I don't have access to real-time weather data, but I can help with other questions!",
    "audio": "/tmp/kdeps-tts/response-abc123.mp3"
  }
}

Structure

telephony-bot/
├── workflow.yaml              # Telephony source, Twilio + Deepgram config
├── components/
│   └── tts/
│       └── component.yaml     # .komponent: online TTS via OpenAI (alloy voice)
└── resources/
    ├── llm-response.yaml      # LLM chat (takes inputTranscript as prompt)
    └── call-response.yaml     # API response with transcript + answer + audio

The tts component encapsulates the run.tts executor. Swap TTS providers by replacing the component without touching the rest of the workflow:

# Build a packaged version for sharing
kdeps package components/tts --output components/

# Or install a community component
kdeps component install tts

Key Expressions

Expression	Description
`inputTranscript`	Caller's speech transcribed to text
`get('llmResponse')`	LLM-generated answer
`ttsOutput`	Path to synthesized audio file

Customization

Use a Different STT Provider

# workflow.yaml
transcriber:
  online:
    provider: assemblyai        # openai-whisper | google-stt | aws-transcribe | deepgram | assemblyai
    apiKey: "YOUR_KEY"

Use ElevenLabs for More Natural TTS

# components/tts/component.yaml
tts:
  mode: online
  voice: "21m00Tcm4TlvDq8ikWAM"   # ElevenLabs voice ID
  online:
    provider: elevenlabs
    apiKey: "xi-your-key"

Use a Stronger LLM

# workflow.yaml
agentSettings:
  models:
    - llama3.1:8b

# resources/llm-response.yaml
chat:
  model: llama3.1:8b

Add a System Prompt for a Custom Persona

# resources/llm-response.yaml
chat:
  scenario:
    - role: assistant
      prompt: |
        You are Aria, a customer support agent for Acme Corp.
        You help callers with order status, returns, and product questions.
        Always ask for the caller's order number before looking up information.

Use Local SIP Device Instead of Twilio

# workflow.yaml
telephony:
  type: local
  device: /dev/ttyUSB0          # USB modem or ATA adapter serial device

telephony-bot

Install

README