VivaProx

Local lab for realtime voice bots, meeting integrations stacks.

Status

Require evaluation pipeline for use case stacks.

Core Tech Links

Pipecat: docs.pipecat.ai | pipecat.ai
LiveKit: docs.livekit.io | livekit.io

Known Pitfall (LiveKit + Pipecat)

In pipecat-ai==0.0.90, some LiveKit flows may still process inbound video tracks even when video input is disabled. In camera-on sessions this can cause agent-runner memory growth and container OOM (ExitCode 137 / OOMKilled=true).
Typical symptom chain: bot drops after a few minutes, then bot-start/API calls fail (fetch failed / 502) because agent-runner exited.
Latest validation on March 2, 2026 (stacks/r3-livekit-meet-lab): single-user manual call remained stable for 18 minutes, and manual bot restart worked; multi-user longevity still pending.
Mitigation pattern for new stacks:
- Keep bot transport video ingest disabled (video_in_enabled=False).
- Add a guard so video subscriptions are ignored unless video processing is explicitly enabled.
- Set agent-runner restart policy in Compose (restart: unless-stopped) to reduce downtime after crashes while debugging.

Top View

UI entry point:

User in browser (web-client or meet) talks to the bot.
UI captures mic audio and plays bot audio.

Core loop events:

Audio capture in browser.
Relay to backend (agent-runner) via LiveKit/WebRTC or other transport.
Transcribe or speech-to-speech ingest.
LLM reasoning (plus optional tool calls).
Audio synthesis (TTS or direct speech-to-speech output).
Relay synthesized audio back to browser.
Browser playback to user.

Tools & Tech

Containers and local orchestration: Docker Compose, Make
Backend services: Python, FastAPI, Pipecat
Frontend apps: Next.js, React
Realtime media/control: LiveKit + WebRTC, Zoom join/leave adapter flow
AI/speech APIs: OpenAI Realtime, Gemini Live, Deepgram, Cartesia
Exploration workflow: Jupyter notebooks with uv

Repo Map

stacks/r0-dev-exploration baseline LiveKit + agent-runner + web-client
stacks/r1-eval-s2s-openai speech-to-speech eval stack (OpenAI)
stacks/r2-eval-s2s-gemini speech-to-speech eval stack (Gemini)
stacks/r3-livekit-meet-lab Meet + Desk admin stack
notebooks exploratory notebooks for TTS/S2S experiments

Try it out

Pick a stack in stacks/ and open its README.
Copy that stack's .env example files.
Add required API keys.
Run make start in the stack directory.
Open the local URLs listed in that stack README.

Stop with make stop.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
notebooks		notebooks
stacks		stacks
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VivaProx

Status

Core Tech Links

Known Pitfall (LiveKit + Pipecat)

Top View

Tools & Tech

Repo Map

Try it out

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

wwbp/vivaprox

Folders and files

Latest commit

History

Repository files navigation

VivaProx

Status

Core Tech Links

Known Pitfall (LiveKit + Pipecat)

Top View

Tools & Tech

Repo Map

Try it out

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages