Meshy User Interview Bot

AI-powered interview bot that conducts user research for Meshy. Features pre-recorded video avatars with real-time speech synthesis and intelligent conversation.

Features

Video Avatar — Pre-recorded video clips with double-buffered playback for smooth transitions
Voice Synthesis — Google Cloud TTS (Chirp 3 HD) with voice consistency lock and Edge TTS fallback
Speech Recognition — Google Cloud STT with brand-name correction, long-audio segmentation, and audio backup retry
AI Interviewer — Gemini 2.5 Flash powers adaptive conversation with V2 interviewing constraints
Real-time WebSocket — Low-latency audio streaming and avatar control
Multi-language — 10 languages supported (en, zh, de, fr, ja, ko, es, pt, ru, it)
Feishu Integration — Auto-saves transcripts and synthesis reports to Feishu Wiki
Domain Vocabulary — 190+ phrase hints for Meshy features, 3D modeling, gaming, and printing terms

Tech Stack

Layer	Technology
LLM	Gemini 2.5 Flash
TTS	Google Cloud TTS (Chirp 3 HD) / Edge TTS fallback
STT	Google Cloud Speech-to-Text (latest_long enhanced model)
Avatar	Pre-recorded video clips (double-buffered)
Backend	FastAPI + WebSocket
Frontend	Vanilla JS + ES Modules
Reports	Feishu Wiki API

Quick Start

Prerequisites

Python 3.12+
uv (recommended) or pip

1. Clone & Install

git clone https://github.com/taichi-dev/user-interview-bot.git
cd user-interview-bot
uv sync

2. Configure Environment

cp .env.example .env

Edit .env and fill in your API keys:

GEMINI_API_KEY=your-gemini-api-key
GEMINI_MODEL=gemini-2.5-flash
GOOGLE_CLOUD_API_KEY=your-google-cloud-api-key
FEISHU_APP_ID=your-feishu-app-id
FEISHU_APP_SECRET=your-feishu-app-secret

3. Run

cd backend
python main.py

Open http://localhost:8000 in your browser.

4. Use

Enter your email
Select language
Choose an avatar
Start the interview — the bot will guide you through questions

Project Structure

user-interview-bot/
├── backend/
│   ├── main.py              # FastAPI server + WebSocket
│   ├── conversation.py      # Gemini-powered conversation engine
│   ├── tts_service.py       # TTS (Chirp 3 HD + voice lock + Edge TTS)
│   ├── stt_service.py       # STT (Google Cloud + phrase hints + corrections)
│   ├── feishu_service.py    # Feishu Wiki integration for reports
│   └── config.py            # Settings & environment
├── frontend/
│   ├── index.html           # Main page
│   ├── js/
│   │   ├── app.js           # App logic & WebSocket client
│   │   ├── avatar.js        # Video avatar management
│   │   ├── speech.js        # STT WebSocket client + audio backup retry
│   │   ├── websocket.js     # Main WebSocket with keepalive
│   │   └── pcm-processor.js # AudioWorklet for mic capture
│   ├── css/                 # Styles
│   └── assets/videos/       # Pre-recorded avatar video clips
├── .env.example
├── pyproject.toml
└── README.md

License

Internal use — Meshy, Inc.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
railway.json		railway.json
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meshy User Interview Bot

Features

Tech Stack

Quick Start

Prerequisites

1. Clone & Install

2. Configure Environment

3. Run

4. Use

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Meshy User Interview Bot

Features

Tech Stack

Quick Start

Prerequisites

1. Clone & Install

2. Configure Environment

3. Run

4. Use

Project Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages