GitHub - sylvanding/omelette: 🍳 Omelette: A full-stack Scientific Literature Lifecycle Management System built with FastAPI, React, and TypeScript.

A full-stack Scientific Literature Lifecycle Management System

中文 · Documentation · Quick Start · Report Bug

Omelette automates the full research literature pipeline — from keyword management and multi-source search, through deduplication and PDF crawling, to OCR processing, RAG-powered knowledge base, and AI writing assistance. V2 adds a chat-centric UX, multi-provider LLM support, LangGraph pipeline orchestration, and MCP integration for AI IDE clients.

Om (Omni-) + Lit (Literature) = Omlit ≈ Omelette 🍳

✨ Features

🔑 Keyword Management Three-level hierarchy with LLM-powered expansion and search formula generation for WOS, Scopus, PubMed.

🔍 Multi-Source Search Federated search across Semantic Scholar, OpenAlex, arXiv, and Crossref with standardized metadata.

🧹 Smart Deduplication Three-stage pipeline: DOI hard dedup → title similarity → LLM verification.

📡 Incremental Subscription RSS feeds and API-based scheduled updates to track new publications automatically.

💬 Chat Playground ChatGPT-style conversational interface for RAG queries and writing assistance.

🔌 Multi-LLM Support LangChain integration for OpenAI, Anthropic, Aliyun, Volcengine, and Ollama providers.

📥 PDF Crawler Multi-channel download via Unpaywall, arXiv, and direct URL fallback strategies.

📝 OCR Processing Native text extraction via MinerU (auto-managed subprocess) or PaddleOCR GPU fallback.

🧠 RAG Knowledge Base LlamaIndex engine with ChromaDB, GPU-aware embeddings, hybrid retrieval, and cited answers.

✍️ Writing Assistant Summarization, citation generation (GB/T 7714, APA, MLA), review outlines, and gap analysis.

🔄 LangGraph Pipeline Pipeline orchestration with HITL interrupt/resume and persistent checkpointing.

⚡ GPU Resource Management TTL-based auto-unload for GPU models, MinerU subprocess auto-management, monitoring API, and exit cleanup watchdog.

🔗 MCP Integration Model Context Protocol server for AI IDE clients (Cursor, Claude Code, etc.).

🌐 i18n Bilingual UI (zh/en) with shadcn/ui and Radix primitives.

🏗️ Architecture

Keywords ─→ Search ─→ Dedup ─→ Crawler ─→ OCR ─→ RAG ─→ Writing
   │          │         │         │        │       │        │
   ▼          ▼         ▼         ▼        ▼       ▼        ▼
[LangChain] [Sources] [SQLite]  [PDFs]  [Paddle] [LlamaIndex] [LLM]
   │                                                      │
   └────────────────── LangGraph ─────────────────────────┘
   │
   └── MCP (Model Context Protocol) ──→ AI IDE clients

Layer	Technology
Backend	FastAPI, SQLAlchemy 2 (async), Pydantic v2, Python 3.12
Frontend	React 18, Vite, TypeScript, TailwindCSS v4, shadcn/ui, Radix, TanStack Query
Database	SQLite + aiosqlite, Alembic migrations
Vector Store	ChromaDB
RAG	LlamaIndex with GPU-aware embeddings
LLM	LangChain (OpenAI, Anthropic, Aliyun, Volcengine, Ollama)
Orchestration	LangGraph with HITL interrupt/resume
OCR	MinerU (auto-managed) + pdfplumber (native) + PaddleOCR (scanned)
MCP	Model Context Protocol server
Docs	VitePress (bilingual EN/ZH)

🚀 Quick Start

Prerequisites

Conda or Miniconda
Node.js 22+
(Optional) CUDA for GPU-accelerated OCR and embeddings
(Optional) API keys: OpenAI, Anthropic, Aliyun Bailian, or Volcengine for LLM; Semantic Scholar for higher rate limits

1. Clone & setup

git clone git@github.com:sylvanding/omelette.git
cd omelette

# Create conda env and install all backend dependencies
conda env create -f environment.yml
conda activate omelette

2. Configure

cp .env.example .env
# Edit .env with your API keys and data paths

Key environment variables

Variable	Description
`DATABASE_URL`	SQLite path (default: `sqlite:///./data/omelette.db`)
`DATA_DIR`	Base path for PDFs, OCR output, ChromaDB
`LLM_PROVIDER`	`openai`, `anthropic`, `aliyun`, `volcengine`, `ollama`, or `mock`
`OPENAI_API_KEY`	OpenAI API key
`ANTHROPIC_API_KEY`	Anthropic API key
`ALIYUN_API_KEY`	Aliyun Bailian API key
`VOLCENGINE_API_KEY`	Volcengine Doubao API key
`SEMANTIC_SCHOLAR_API_KEY`	Optional; increases Semantic Scholar rate limit
`GPU_MODE`	GPU preset: `conservative`, `balanced` (default), `aggressive`
`MODEL_TTL_SECONDS`	Auto-unload GPU models after N seconds idle (default: 300)
`MINERU_AUTO_MANAGE`	Auto start/stop MinerU subprocess (default: true)
`PDF_PARSER`	`auto`, `mineru`, or `pdfplumber`

See .env.example for the full list.

3. Start backend

cd backend

# Run database migrations
alembic upgrade head

# Start server
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

On startup, the backend automatically:

Writes a PID file to DATA_DIR/omelette.pid
Starts a GPU model TTL monitor (auto-unloads idle models)
If MINERU_AUTO_MANAGE=true, manages MinerU subprocess lifecycle
Registers cleanup handlers (atexit + SIGHUP) so GPU resources are released even if the process exits unexpectedly

4. (Optional) GPU watchdog

For extra safety against kill -9 or crashes, run the external watchdog:

python backend/scripts/gpu_watchdog.py --daemon

The watchdog monitors the Omelette process and cleans up GPU resources if it terminates abnormally.

5. Start frontend

cd frontend
npm install
npm run dev

Open http://localhost:3000 in your browser.

6. (Optional) MinerU setup

If using MinerU for PDF parsing (PDF_PARSER=mineru):

# Create a separate conda env for MinerU
conda create -n mineru python=3.10
conda activate mineru
pip install magic-pdf[full]

Set MINERU_CONDA_ENV=mineru in .env. Omelette will auto-start MinerU when needed.

Troubleshooting: If you get ModuleNotFoundError: No module named 'fastapi', ensure the conda environment is activated: conda activate omelette.

📂 Project Layout

omelette/
├── backend/              # FastAPI application
│   ├── app/
│   │   ├── api/v1/       # REST endpoints
│   │   ├── models/       # SQLAlchemy ORM models
│   │   ├── schemas/      # Pydantic request/response schemas
│   │   ├── services/     # Business logic
│   │   ├── pipelines/    # LangGraph pipeline definitions
│   │   ├── config.py     # Settings from .env
│   │   ├── database.py   # Async engine and session
│   │   └── main.py       # App entry, lifespan, CORS
│   ├── mcp_server.py     # MCP (Model Context Protocol) server
│   ├── alembic/          # Database migrations
│   ├── scripts/          # Utilities (gpu_watchdog.py)
│   ├── tests/            # pytest-asyncio tests (526 tests)
│   └── pyproject.toml    # Python dependencies
├── frontend/             # React SPA
│   └── src/
│       ├── pages/        # Dashboard, ProjectDetail, Chat, modules
│       ├── components/   # Layout, shared UI
│       │   └── ui/       # shadcn/ui components
│       ├── services/     # Typed API client
│       ├── hooks/        # Custom hooks (useToastMutation, etc.)
│       ├── stores/       # Zustand state
│       ├── i18n/         # Internationalization (zh/en)
│       ├── test/         # Vitest setup, MSW mocks, fixtures
│       └── lib/          # Axios client, utils
├── e2e/                  # Playwright E2E tests
├── docs/                 # VitePress documentation (EN/ZH)
├── assets/               # Banner, logo, mascot images
├── environment.yml       # Conda env (Python 3.12)
├── Makefile              # Dev workflow shortcuts
├── .env.example          # Configuration template
├── playwright.config.ts  # Playwright E2E configuration
└── .github/workflows/    # CI (ruff, pytest, vitest, tsc, build, docs)

🛠️ Development

make pre-commit-install   # Install pre-commit hooks
make lint                 # Run linters
make format               # Auto-format code
make test                 # Run all tests
make dev                  # Start both backend and frontend

Running Tests

# Backend (526 tests)
cd backend && pytest tests/ -v

# Frontend unit tests (28 tests — Vitest + Testing Library + MSW)
cd frontend && npm test

# Frontend type check and build
cd frontend && npx tsc --noEmit && npm run build

# E2E tests (optional — requires running frontend dev server)
npx playwright test

📡 API Overview

REST APIs under /api/v1/:

Endpoint	Description
`GET/POST /projects`	Project CRUD
`GET/POST /projects/{id}/papers`	Paper management
`GET/POST /projects/{id}/keywords`	Keyword management
`GET /projects/{id}/keywords/search-formula`	Generate search formula
`POST /projects/{id}/search`	Execute multi-source search
`POST /projects/{id}/dedup/run`	Run deduplication
`POST /projects/{id}/crawl/start`	Start PDF download
`POST /projects/{id}/ocr/process`	Run OCR on papers
`POST /projects/{id}/rag/index`	Build vector index
`POST /projects/{id}/rag/query`	RAG retrieval
`POST /projects/{id}/writing/assist`	Writing assistance
`POST /projects/{id}/writing/review-draft/stream`	Streaming literature review (SSE)
`POST /chat`	Chat messages (playground)
`POST /chat/complete`	Smart autocomplete suggestions
`GET /projects/{id}/papers/{paper_id}/citation-graph`	Citation graph (Semantic Scholar)
`GET/POST /conversations`	Conversation CRUD
`GET/POST /pipelines`	Pipeline management
`GET/POST /subscriptions`	Subscription management
`GET/POST /settings`	Settings and health
`GET /settings/health`	Health check
`GET /gpu/status`	GPU model and memory status
`POST /gpu/unload`	Manually unload GPU models

MCP server: /mcp (WebSocket/SSE for AI IDE clients)

Full documentation: API Reference

🤝 Contributing

See CONTRIBUTING.md for guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.claude		.claude
.cursor		.cursor
.github/workflows		.github/workflows
assets		assets
backend		backend
docs		docs
e2e		e2e
frontend		frontend
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_zh.md		README_zh.md
environment.yml		environment.yml
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ Features

🏗️ Architecture

🚀 Quick Start

Prerequisites

1. Clone & setup

2. Configure

3. Start backend

4. (Optional) GPU watchdog

5. Start frontend

6. (Optional) MinerU setup

📂 Project Layout

🛠️ Development

Running Tests

📡 API Overview

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✨ Features

🏗️ Architecture

🚀 Quick Start

Prerequisites

1. Clone & setup

2. Configure

3. Start backend

4. (Optional) GPU watchdog

5. Start frontend

6. (Optional) MinerU setup

📂 Project Layout

🛠️ Development

Running Tests

📡 API Overview

🤝 Contributing

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages