RAGGuard

Detect hallucinations in RAG pipelines — verify that LLM responses are faithfully grounded in retrieved source documents.

How It Works

flowchart LR
    A[LLM Response] --> B[Tokenize & Split]
    C[Source Context] --> D[Tokenize & Split]
    B --> E[Token Overlap]
    B --> F[TF-IDF Vectors]
    D --> F
    E --> G[Weighted Score]
    F --> H[Cosine Similarity]
    H --> G
    G --> I{GroundednessScore}
    I -->|>= 0.70| J[✅ Grounded]
    I -->|0.40 – 0.70| K[⚠️ Partially Grounded]
    I -->|< 0.40| L[❌ Hallucinated]

Quickstart

Install

pip install git+https://github.com/MukundaKatta/RAGGuard.git

Python API

from ragguard import HallucinationDetector, generate_faithfulness_report

detector = HallucinationDetector()

context = "Python was created by Guido van Rossum and released in 1991."
response = "Python was created by Guido van Rossum in 1991."

result = detector.score(response, context)
print(result.score)   # 0.95
print(result.label)   # "grounded"

# Detailed per-sentence report
report = generate_faithfulness_report(response, context)
for s in report.sentence_scores:
    print(f"  {s.sentence[:50]}  →  {s.score:.0%}  {'✅' if s.grounded else '❌'}")

CLI

ragguard check \
  --response "Python was released in 1991." \
  --context "Python was created by Guido van Rossum and first released in 1991."

# Detailed breakdown
ragguard check -r "..." -c "..." --detailed

Fact Checking

from ragguard import FactChecker

checker = FactChecker()
results = checker.check(
    response="Python was released in 1991. It was created by Linus Torvalds.",
    source_documents=["Python was created by Guido van Rossum and released in 1991."],
)
for r in results:
    print(f"  {r['claim'][:50]}  →  grounded={r['grounded']}")

Citation Verification

from ragguard import CitationVerifier

verifier = CitationVerifier()
results = verifier.verify([
    {"claim": "Python was released in 1991.", "source": "Python first appeared in 1991."},
])
print(results[0].supported)  # True

Features

Feature	Description
HallucinationDetector	Combined token-overlap + TF-IDF semantic scoring
FactChecker	Extract claims and verify each against source docs
CitationVerifier	Check that cited sources support their claims
FaithfulnessReport	Per-sentence grounding breakdown
CLI	Quick checks from the command line with Rich output
Zero ML dependencies	Pure Python — no torch, no transformers

Configuration

from ragguard import Settings

settings = Settings()
settings.weights.token_overlap = 0.35
settings.weights.semantic_similarity = 0.65
settings.thresholds.grounded = 0.70

Running Tests

pip install pytest
python -m pytest tests/ -v

License

MIT — see LICENSE.

Built by Officethree Technologies | Made with ❤️ and AI

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
docs		docs
src/ragguard		src/ragguard
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGGuard

How It Works

Quickstart

Install

Python API

CLI

Fact Checking

Citation Verification

Features

Configuration

Running Tests

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAGGuard

How It Works

Quickstart

Install

Python API

CLI

Fact Checking

Citation Verification

Features

Configuration

Running Tests

License

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages