A voice assistant application powered by Gemini and Twilio, capable of handling inbound and outbound calls with real-time AI interaction.
- Real-time Voice AI: Uses Gemini Live API for voice conversations.
- Twilio Integration: Handles phone calls via Twilio Voice.
- Call Summarization: automatically summarizes calls after they end.
- Docker (and Docker Compose)
- Twilio Account (with phone number used for voice assistant)
- Ngrok Account
- Gemini API Key
cp .env.example .env
Edit .env to provide API keys and settings.
To start the application:
sudo docker compose up --build
POST /call: Initiate an outbound call.
curl -X POST "http://0.0.0.0:8080/call" \
-H "Content-Type: application/json" \
-d '{
"to_number": "+1234567890",
"prompt": "Ask how life is going"
}'- After every call, a call summary is sent to the configured webhook.
{
"call": {
"call metadata here"
},
"summarized_text": "AI summarization of call"
}