Medical Chatbot

A comprehensive medical chatbot application built with Flask, Gemini AI, Pinecone, and PostgreSQL. Features real-time chat with document-based RAG (Retrieval-Augmented Generation), source citations, conversation history, and advanced query processing.

🚀 Quick Start

Prerequisites

Docker Desktop installed (Download here)
Pinecone API Key (Get one here)
Google Gemini API Key (Get one here)

First Time Setup

Create .env file in the project root:

PINECONE_API_KEY=your_pinecone_key_here
GOOGLE_API_KEY=your_gemini_key_here
SECRET_KEY=change-this-to-random-string-in-production
GEMINI_MODEL=gemini-2.5-flash

Start the application:
```
docker-compose up --build
```
Open browser: http://localhost:8080
Register an account and start chatting!

For detailed Docker setup and running instructions, see DOCKER_SETUP.md.

✨ Features

✅ User Authentication - Secure registration and login system
✅ Multi-Document Upload - Upload and manage multiple PDF documents
✅ Real-time Chat - Streaming responses with immediate feedback
✅ Source Citations - Transparent source attribution with clickable citations
✅ Conversation History - Persistent chat history per user
✅ User Feedback - Thumbs up/down feedback system
✅ Advanced RAG - Query rewriting and multi-hop reasoning
✅ Document Management - View, manage, and delete uploaded documents

🏗️ Architecture

Backend: Flask (Python 3.10)
Database: PostgreSQL 15 (Docker container)
Vector Store: Pinecone (384-dimensional embeddings)
LLM: Google Gemini (configurable model)
Embeddings: HuggingFace sentence-transformers (all-MiniLM-L6-v2)
Frontend: HTML/CSS/JavaScript with Bootstrap

📁 Project Structure

medical-chatbot/
├── app.py                  # Main Flask application
├── docker-compose.yml      # Docker Compose configuration
├── Dockerfile              # Application container definition
├── requirements.txt        # Python dependencies
├── .env                    # Environment variables (create this)
├── src/                    # Source code
│   ├── database.py        # Database models (User, Document, Conversation, etc.)
│   ├── auth.py            # Authentication routes
│   ├── helper.py          # Helper functions (PDF loading, text splitting)
│   ├── prompt.py          # System prompts
│   └── rag_advanced.py    # Advanced RAG features (query rewriting, multi-hop)
├── tests/                  # Test suite
│   ├── conftest.py        # Pytest fixtures
│   ├── test_auth.py       # Authentication tests
│   ├── test_chat_api.py   # Chat API tests
│   ├── test_database.py   # Database model tests
│   ├── test_documents_api.py  # Document management tests
│   ├── test_feedback_api.py      # Feedback system tests
│   ├── test_integration.py       # Integration tests
│   └── test_rag_advanced.py      # Advanced RAG tests
├── templates/              # HTML templates
│   ├── base.html          # Base template with navbar
│   ├── chat.html          # Chat interface
│   ├── documents.html     # Document management page
│   ├── login.html         # Login page
│   └── register.html      # Registration page
├── static/                 # CSS/JS files
│   └── style.css          # Custom styles
└── data/
    └── uploads/           # Uploaded PDFs (persisted via Docker volume)

🔧 Configuration

Environment Variables

Create a .env file in the project root with:

# Required
PINECONE_API_KEY=your_pinecone_api_key
GOOGLE_API_KEY=your_gemini_api_key
SECRET_KEY=your-random-secret-key-change-in-production

# Optional (with defaults)
GEMINI_MODEL=gemini-2.5-flash  # Default: gemini-2.5-flash
DATABASE_URL=postgresql://medicalbot:medicalbot_password@db:5432/medical_chatbot  # Auto-set by docker-compose

For detailed Docker setup instructions, see DOCKER_SETUP.md.

🧪 Testing

For comprehensive testing instructions, see TESTING.md.

Quick start:

# Run all tests
docker-compose exec app pytest

# Run with coverage
docker-compose exec app pytest --cov=src --cov=app

🔌 API Endpoints

Authentication

GET /auth/login - Login page
POST /auth/login - Login
GET /auth/register - Registration page
POST /auth/register - Register new user
GET /auth/logout - Logout

Chat

GET /chat - Chat interface
POST /api/chat/stream - Stream chat response (Server-Sent Events)
- Body: { "message": "...", "conversation_id": 123, "use_advanced_rag": false }

Documents

GET /documents - Document management page
POST /api/upload - Upload new PDF document (multipart/form-data)
DELETE /api/documents/<id> - Delete document

Conversations

GET /api/conversations - Get user's conversations
GET /api/conversations/<id>/messages - Get messages for conversation

Feedback

POST /api/feedback - Submit feedback for a message
- Body: { "message_id": 123, "rating": "positive|negative", "comment": "..." }

🗄️ Database Schema

User - User accounts with authentication
Document - Uploaded PDF documents metadata
DocumentChunk - Chunk metadata for citations
Conversation - Chat conversations
Message - Individual messages in conversations
Citation - Source citations for messages
Feedback - User feedback on messages

🚀 Advanced Features

Query Rewriting

Automatically improves user queries for better document retrieval. Enabled by default in standard RAG mode.

Multi-hop Reasoning

Breaks down complex questions into sub-questions and retrieves information iteratively. Enable via the "Advanced RAG" toggle in the chat interface.

Source Citations

Every response includes citations to source documents with:

Document name
Page number
Content preview
Clickable badges for easy navigation

🐛 Troubleshooting

Gemini Model Not Found Error

If you see 404 models/gemini-pro is not found:

Check available models:

docker-compose exec app python -c "
from google import generativeai as genai
import os
genai.configure(api_key=os.environ.get('GOOGLE_API_KEY'))
for model in genai.list_models():
    if 'generateContent' in model.supported_generation_methods:
        print(f'{model.name}')
"

Update .env file with a model from the list:

GEMINI_MODEL=gemini-2.5-flash  # or gemini-2.5-pro, gemini-pro-latest, etc.

Restart the app:
```
docker-compose restart app
```

File Upload Not Working

Check that the upload folder exists and is writable
Verify file size is under 16MB
Check browser console for errors
Ensure you're logged in (authentication required)

For more troubleshooting tips, see DOCKER_SETUP.md.

🧪 Testing

Run tests with:

docker-compose exec app pytest

Note: The tests directory is mounted, so you can edit tests and run them immediately without restarting. Only restart the container if tests are hanging or using cached files.

For more details, see TESTING.md.

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Run tests: docker-compose exec app pytest
Submit a pull request

📞 Support

For issues and questions, please open an issue on GitHub.

📄 License

See LICENSE file

Note: This application uses Google Gemini AI for generating responses. Make sure you have a valid API key and that billing is enabled on your Google Cloud project if required for your chosen model.

Documentation:

DOCKER_SETUP.md - Detailed Docker setup and running instructions
TESTING.md - Comprehensive testing guide

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Chatbot

🚀 Quick Start

Prerequisites

First Time Setup

✨ Features

🏗️ Architecture

📁 Project Structure

🔧 Configuration

Environment Variables

🧪 Testing

🔌 API Endpoints

Authentication

Chat

Documents

Conversations

Feedback

🗄️ Database Schema

🚀 Advanced Features

Query Rewriting

Multi-hop Reasoning

Source Citations

🐛 Troubleshooting

Gemini Model Not Found Error

File Upload Not Working

🧪 Testing

🤝 Contributing

📞 Support

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
data		data
research		research
src		src
static		static
templates		templates
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
DOCKER_SETUP.md		DOCKER_SETUP.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
TESTING.md		TESTING.md
app.py		app.py
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
restart_tests.ps1		restart_tests.ps1
store_index.py		store_index.py

Folders and files

Latest commit

History

Repository files navigation

Medical Chatbot

🚀 Quick Start

Prerequisites

First Time Setup

✨ Features

🏗️ Architecture

📁 Project Structure

🔧 Configuration

Environment Variables

🧪 Testing

🔌 API Endpoints

Authentication

Chat

Documents

Conversations

Feedback

🗄️ Database Schema

🚀 Advanced Features

Query Rewriting

Multi-hop Reasoning

Source Citations

🐛 Troubleshooting

Gemini Model Not Found Error

File Upload Not Working

🧪 Testing

🤝 Contributing

📞 Support

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages