🧬 Helix: Temporal GraphRAG

LightRAG + Graphiti = Temporal Knowledge Graphs for RAG

🎯 What is Helix?

Helix fuses LightRAG's proven dual-level retrieval with Graphiti's bi-temporal Knowledge Graph to create a next-generation RAG system with:

Feature	Capability
Temporal Awareness	Point-in-time queries, automatic edge invalidation
Multi-Hop Reasoning	BFS-based path exploration with scoring
Hallucination Detection	Composite Fidelity Index (CFI) verification
Incremental Updates	No full graph rebuild required

📊 Benchmark Targets

Category	Datasets	Metrics	Target	Baseline
Temporal	Time-LongQA, ECT-QA, MultiTQ	Hit@1, Hit@5, Acc	70-75%	45-55%
Hallucination	Legal QA, Medical QA, FEVER	AUC, CFI	>0.95	0.84-0.94
Multi-Hop	MuSiQue, 2WikiMHQA, HotpotQA	F1, EM	70-75	54-59
Scalability	UltraDomain (all)	Tokens, Latency	<600K	14M

📦 Installation

From PyPI

pip install helix-rag

From Source (Development)

git clone https://github.com/YashNuhash/Helix.git
cd Helix

# Install with Helix dependencies
pip install -e ".[helix]"

Dependencies

Helix requires:

Neo4j (for Graphiti Knowledge Graph)
Supabase (optional, for vector storage)
LLM API (any provider - configured via environment)

⚙️ Configuration

Copy .env.example to .env and configure:

cp .env.example .env

Required Environment Variables

# Neo4j Configuration (for Graphiti)
NEO4J_URI=bolt://localhost:7687
NEO4J_USERNAME=neo4j
NEO4J_PASSWORD=your_password

# LLM Configuration (model-agnostic)
LLM_MODEL_NAME=your_model_name
LLM_API_KEY=your_api_key

# Supabase (optional)
SUPABASE_URL=https://your-project.supabase.co
SUPABASE_KEY=your_key

Supabase Setup (Optional)

Run scripts/supabase_schema.sql in your Supabase SQL Editor to create the vector storage table.

🚀 Quick Start

Basic Usage

import asyncio
from helix import Helix

async def main():
    # Initialize Helix
    async with Helix() as helix:
        # Insert document with temporal tracking
        result = await helix.insert(
            "Alan Turing was born on June 23, 1912. "
            "He is considered the father of computer science.",
            source_description="Wikipedia"
        )
        print(f"Extracted {result['entities_extracted']} entities")
        
        # Query with temporal awareness
        answer = await helix.query(
            "When was Alan Turing born?",
            mode="hybrid"
        )
        print(answer["answer"])

asyncio.run(main())

Temporal Queries

from datetime import datetime
from helix import Helix
from helix.utils import is_temporal_query, extract_temporal_params

async def temporal_example():
    async with Helix() as helix:
        # Detect temporal intent
        query = "What was the CEO of Apple in 2015?"
        
        if is_temporal_query(query):
            params = extract_temporal_params(query)
            print(f"Temporal query detected: {params.temporal_keywords}")
        
        # Query with point-in-time context
        result = await helix.query(
            query,
            valid_at=datetime(2015, 1, 1),
            include_temporal_context=True
        )
        print(result)

asyncio.run(temporal_example())

Hallucination Detection

from helix.hallucination import HallucinationDetector

async def verify_response():
    async with Helix() as helix:
        detector = HallucinationDetector(graphiti=helix.graphiti)
        
        # Get response
        result = await helix.query("Tell me about Alan Turing")
        
        # Verify against knowledge graph
        verification = await detector.verify_response(
            response=result["answer"],
            query="Tell me about Alan Turing",
            context=result.get("temporal_context")
        )
        
        print(f"Grounded: {verification.is_grounded}")
        print(f"CFI Score: {verification.confidence_score:.2f}")
        print(f"Entity Coverage: {verification.entity_coverage:.2%}")

asyncio.run(verify_response())

Multi-Hop Reasoning

from helix.multihop import MultiHopRetriever

async def multihop_example():
    async with Helix() as helix:
        retriever = MultiHopRetriever(graphiti=helix.graphiti)
        
        # Find reasoning paths
        paths = await retriever.find_paths(
            query="How is Alan Turing connected to modern AI?",
            max_hops=3
        )
        
        # Format as context
        context = retriever.format_paths_as_context(paths)
        print(context)

asyncio.run(multihop_example())

📈 Evaluation

Running Benchmarks

Helix includes evaluation scripts for academic benchmarks. Use these in Google Colab or Kaggle:

# Install Helix
!pip install helix-rag

# Run temporal benchmark
from helix.eval import TemporalBenchmark

benchmark = TemporalBenchmark(dataset="time-longqa")
results = await benchmark.run()
print(f"Hit@1: {results['hit_at_1']:.2%}")

Supported Benchmarks

Benchmark	Dataset	Command
Temporal	Time-LongQA	`helix eval --dataset time-longqa`
Temporal	ECT-QA	`helix eval --dataset ect-qa`
Multi-Hop	MuSiQue	`helix eval --dataset musique`
Multi-Hop	HotpotQA	`helix eval --dataset hotpotqa`
Hallucination	FEVER	`helix eval --dataset fever`
Scalability	UltraDomain	`helix eval --dataset ultradomain`

Colab/Kaggle Notebook

# Quick evaluation notebook
import os
os.environ["LLM_API_KEY"] = "your_key"
os.environ["LLM_MODEL_NAME"] = "your_model"
os.environ["NEO4J_URI"] = "bolt://localhost:7687"
os.environ["NEO4J_PASSWORD"] = "password"

from helix import Helix
from helix.eval import run_all_benchmarks

# Run all benchmarks
results = await run_all_benchmarks()
print(results.to_dataframe())

🏗️ Architecture

┌─────────────────────────────────────────────────────────────┐
│                         Helix                                │
├─────────────────────────────────────────────────────────────┤
│  ┌─────────────┐  ┌──────────────┐  ┌───────────────────┐   │
│  │   LightRAG  │  │   Graphiti   │  │  Helix Modules    │   │
│  │  (Retrieval)│  │ (Temporal KG)│  │                   │   │
│  ├─────────────┤  ├──────────────┤  ├───────────────────┤   │
│  │ - Chunking  │  │ - Episodes   │  │ - TemporalHandler │   │
│  │ - Embedding │  │ - Bi-temporal│  │ - Hallucination   │   │
│  │ - Vector DB │  │ - Resolution │  │ - MultiHop        │   │
│  │ - Dual-level│  │ - Invalidate │  │ - CFI Scoring     │   │
│  └──────┬──────┘  └──────┬───────┘  └─────────┬─────────┘   │
│         │                │                    │              │
│         └────────────────┼────────────────────┘              │
│                          ▼                                   │
│  ┌─────────────────────────────────────────────────────┐    │
│  │                    Storage Layer                     │    │
│  ├─────────────────────────────────────────────────────┤    │
│  │  Neo4j (Graph)  │  Supabase (Vector)  │  Local KV   │    │
│  └─────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────┘

📁 Project Structure

helix/
├── __init__.py           # Package entry (v0.1.1)
├── core/
│   └── helix.py          # Main Helix class
├── storage/
│   ├── graphiti_impl.py  # GraphitiGraphStorage
│   └── supabase_impl.py  # SupabaseVectorStorage
├── temporal/
│   └── query_handler.py  # TemporalQueryHandler
├── hallucination/
│   └── detector.py       # HallucinationDetector (CFI)
├── multihop/
│   └── retriever.py      # MultiHopRetriever (BFS)
└── utils/
    └── temporal_utils.py # Temporal parsing

🔬 Research Goals

Helix is designed to achieve state-of-the-art performance on:

Temporal GraphRAG: 70-75% accuracy on temporal QA benchmarks
Hallucination Detection: AUC >0.95 using graph-aligned verification
Multi-Hop Reasoning: F1 70-75 on complex reasoning benchmarks
Scalability: <600K tokens for indexing (vs 14M baseline)

See PLAN.md for detailed research methodology.

📚 Citation

If you use Helix in your research, please cite:

@software{helix2024,
  title = {Helix: Temporal GraphRAG with LightRAG and Graphiti},
  author = {Your Name},
  year = {2024},
  url = {https://github.com/YashNuhash/Helix}
}

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

📄 License

MIT License - see LICENSE for details.

Built with 🧬 Helix

LightRAG + Graphiti = Temporal GraphRAG

Name		Name	Last commit message	Last commit date
Latest commit History 6,268 Commits
.clinerules		.clinerules
.github		.github
README.assets		README.assets
assets		assets
docs		docs
examples		examples
helix		helix
k8s-deploy		k8s-deploy
lightrag		lightrag
lightrag_webui		lightrag_webui
notebooks		notebooks
reproduce		reproduce
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Dockerfile.lite		Dockerfile.lite
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
PLAN.md		PLAN.md
README-zh.md		README-zh.md
README.md		README.md
SECURITY.md		SECURITY.md
config.ini.example		config.ini.example
docker-build-push.sh		docker-build-push.sh
docker-compose.yml		docker-compose.yml
env.example		env.example
lightrag.service.example		lightrag.service.example
pyproject.toml		pyproject.toml
requirements-offline-llm.txt		requirements-offline-llm.txt
requirements-offline-storage.txt		requirements-offline-storage.txt
requirements-offline.txt		requirements-offline.txt
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧬 Helix: Temporal GraphRAG

🎯 What is Helix?

📊 Benchmark Targets

📦 Installation

From PyPI

From Source (Development)

Dependencies

⚙️ Configuration

Required Environment Variables

Supabase Setup (Optional)

🚀 Quick Start

Basic Usage

Temporal Queries

Hallucination Detection

Multi-Hop Reasoning

📈 Evaluation

Running Benchmarks

Supported Benchmarks

Colab/Kaggle Notebook

🏗️ Architecture

📁 Project Structure

🔬 Research Goals

📚 Citation

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧬 Helix: Temporal GraphRAG

🎯 What is Helix?

📊 Benchmark Targets

📦 Installation

From PyPI

From Source (Development)

Dependencies

⚙️ Configuration

Required Environment Variables

Supabase Setup (Optional)

🚀 Quick Start

Basic Usage

Temporal Queries

Hallucination Detection

Multi-Hop Reasoning

📈 Evaluation

Running Benchmarks

Supported Benchmarks

Colab/Kaggle Notebook

🏗️ Architecture

📁 Project Structure

🔬 Research Goals

📚 Citation

🤝 Contributing

📄 License

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages