SearchGPT 🔍

LLM-powered search engine with hybrid search and re-ranking capabilities.

Features

🔄 Hybrid Search: Combines FAISS and ElasticSearch for optimal results
🤖 LLM Re-ranking: Uses large language models to re-rank search results for better relevance
⚡ FastAPI Backend: High-performance REST API
📊 Evaluation Metrics: Built-in support for NDCG, MRR, and other IR metrics
🧪 Comprehensive Testing: Full test suite with pytest

Project Structure

SearchGPT/
├── src/
│   ├── api/              # FastAPI application
│   ├── hybrid_search/    # Hybrid search implementation
│   ├── llm_reranking/    # LLM re-ranking logic
│   ├── evaluation/       # Metrics and benchmarks
│   ├── core/            # Utilities (config, logging, cache)
│   └── deployment/      # Docker and deployment files
├── tests/               # Test suite
├── scripts/             # Utility scripts
├── data/               # Data directory (indices, embeddings)
└── resources/          # Research papers and documentation

Getting Started

Prerequisites

Python 3.9+
UV (recommended) or pip

Installation

Clone the repository

git clone https://github.com/YourUsername/SearchGPT.git
cd SearchGPT

Install dependencies with UV
```
uv sync
```
Or with pip:
```
pip install -e .
```

Set up environment variables

cp .env.example .env
# Edit .env and add your API keys

Running the API

uv run uvicorn src.api.main:app --reload

uvicorn src.api.main:app --reload --host 0.0.0.0 --port 8000

Visit http://localhost:8000/docs for the interactive API documentation.

Running Tests

uv run pytest

pytest

pytest --cov=src --cov-report=html

API Usage

Search Endpoint

curl -X POST "http://localhost:8000/api/v1/search" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "How does hybrid search work?",
    "top_k": 10,
    "use_reranking": true,
    "hybrid_alpha": 0.5
  }'

Response

{
  "query": "How does hybrid search work?",
  "results": [
    {
      "id": "doc1",
      "title": "Introduction to Hybrid Search",
      "content": "Hybrid search combines...",
      "score": 0.95,
      "metadata": {}
    }
  ],
  "total": 1,
  "processing_time_ms": 123.45
}

Configuration

Configuration is managed through environment variables (see .env.example):

OPENAI_API_KEY: OpenAI API key for embeddings and re-ranking
DEFAULT_LLM_MODEL: LLM model to use (default: gpt-4o-mini)
EMBEDDING_MODEL: Embedding model (default: text-embedding-3-small)
DEFAULT_TOP_K: Number of results to return (default: 10)
DEFAULT_HYBRID_ALPHA: Balance between BM25 (0.0) and vector (1.0) search

Development

Code Formatting

uv run black src tests

uv run ruff check src tests

Adding Dependencies

uv add package-name

uv add --dev package-name

Scripts

scripts/setup_indices.py: Initialize search indices
scripts/run_benchmark.py: Run evaluation benchmarks

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes with tests
Run the test suite
Submit a pull request

License

MIT License - see LICENSE file for details

Resources

Research papers and documentation can be found in the resources/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
docs		docs
resources		resources
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
ROADMAP.md		ROADMAP.md
pyproject.toml		pyproject.toml
run.py		run.py
run_api.py		run_api.py
run_frontend.py		run_frontend.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SearchGPT 🔍

Features

Project Structure

Getting Started

Prerequisites

Installation

Running the API

Running Tests

API Usage

Search Endpoint

Response

Configuration

Development

Code Formatting

Adding Dependencies

Scripts

Contributing

License

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SearchGPT 🔍

Features

Project Structure

Getting Started

Prerequisites

Installation

Running the API

Running Tests

API Usage

Search Endpoint

Response

Configuration

Development

Code Formatting

Adding Dependencies

Scripts

Contributing

License

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages