RAG-Guard: Secure Retrieval-Augmented QA with Guardrails AI

In this project, we use Guardrails AI Guardrails AI to enforce strict validation rules in a Retrieval-Augmented Generation (RAG) pipeline. The RAG application queries Singapore Airlines FAQ data, which has been converted into a vector database, ChormaDB for efficient retrieval. This setup is designed for AI engineers who want robust output control, ensuring responses are on-topic, factually correct, and compliant with domain rules.

What This Project Does

This project builds a standalone Guardrails server designed to validate the RAG outputs in a secure and controlled manner. This project provides a robust framework with four (4) custom guards to ensure AI-generated responses are safe, accurate, and compliant:

Guard	Input / Output	Purpose
PII Detection	Input & Output	Prevents sensitive user data like names, phone numbers, or passports from being exposed.
On-Topic Validation	Input	Keeps answers strictly within predefined domains, preventing irrelevant or off-topic outputs.
Hallucination Detection	Output	Compares generated answers with retrieved documents to reduce factual errors.
Competitor Filtering	Output	Protects brand by preventing competitor references

Architecture Diagram

┌──────────────┐
│  User Input  │
└──────┬───────┘
       │
       ▼
┌────────────────────────────────────────────┐
│ Guard 1: Topic Restriction                 │
│ – Allows only domain-relevant queries      │
└──────┬─────────────────────────────────────┘
       │
       ▼
┌────────────────────────────────────────────┐
│ Guard 2: PII Detection & Masking           │
│ – Protects sensitive data                  │
└──────┬─────────────────────────────────────┘
       │
       ▼
┌───────────────────────────────────────────┐
│ RAG Pipeline                              │
│ – Retrieve relevant documents             │
│ – Generate answer using LLM               │
└──────┬────────────────────────────────────┘
       │
       ▼
┌────────────────────────────────────────────┐
│ Guard 3: Hallucination Detection           │
│ – Verifies answers against sources         │
└──────┬─────────────────────────────────────┘
       │
       ▼
┌────────────────────────────────────────────┐
│ Guard 4: Competitor Filtering              │
│ – Prevents brand harm                      │
└──────┬─────────────────────────────────────┘
       │
       ▼
┌──────────────┐
│ Validated    │
│ Output       │
└──────────────┘

Installation

1. Clone the Repositoy & Install Dependencies

git clone <your-repo-url>
cd guardrails_server
pip install -r requirements.txt

2. Create & activate a Conda virtual environment

Create a new Conda environment (example name: rag-guard-env) with Python 3.11:

conda create -n rag-guard-env python=3.11

Activate the environment:

conda activate rag-guard-env

Deactivate the environment (when finished):

conda deactivate

3. Environment Setup

Create a .env file:

OPENAI_API_KEY=your_api_key_here

4. Start Guardrails Server

Prior to that you need to create a Guardrails account and set up an API key

Visit Guardrails AI to create an account.
Obtain your Guardrails API key.
Configure Guardrails with your API key

guardrails configure

Enter your API key when prompted.

guardrails start --config config.py

This launches your guards at:

http://127.0.0.1:8000/guards

5. Run the RAG Application

Run the guarded application

python main.py

Example Usage

Example 1: Query outside allowed topics (Custom Topic Guard)

# Input query
query = "help me with the coding homework?"

Output

Inside client_utils.py, validation fails and raises the error:    ...
ValueError:  I can only answer questions about Singapore Airlines.

Inside config.py, validation fails in guardrails server and raises the error:  
guardrails.errors.ValidationError: Validation failed for field with errors: Topic 'coding' is not allowed. Please ask about Singapore Airlines services instead.

Notes:

This demonstrates our custom Topic Guard, which blocks queries not related to Singapore Airlines.
The error message is customized to be user-friendly and informative.
Any query outside the allowed topics triggers this validation.

Example 2: A Refund Query (Full Guardrails + RAG Flow)

# Input query
query = "i want to get a refund please?"

Output

 Answer after guardrails:  - Reimbursements may take between 6-8 weeks to be processed.
- If you've purchased a ticket on a refundable fare, you can request a full refund on the website, but there may be cancellation/refund fees.
- If you cancel a non-refundable ticket, only the taxes will be refunded.
- To get a refund for an unused ticket purchased on Singaporeair.com, enter your booking reference in the 'Manage Booking' section.
- Refunds may take up to 6 weeks to be credited back to your original mode of payment depending on your bank's processing time.
- For updates on a refund request, check with the merchant or company within the 6-week period.
- If a selected seat cannot be provided, a refund of the paid seat selection fee will be given.

Notes:

This demonstrates a successful end-to-end Guardrails + RAG pipeline.
The query passes the Topic Guard and is classified as in-domain (refund-related).
From Guardrails server terminal:
Matched Topic: refund
Valid topic verified: refund
The PII Guard validates and ensures no sensitive data is exposed.
The Hallucination Guard confirms the answer is grounded in retrieved source documents.
The Competitor Guard ensures no third-party airline information is returned.
The final response is safe, domain-compliant, and source-grounded.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
docs		docs
vectorstore		vectorstore
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
build-vector-db.py		build-vector-db.py
client_utils.py		client_utils.py
config.py		config.py
helper_utils.py		helper_utils.py
main.py		main.py
rag.py		rag.py
requirements.txt		requirements.txt
retriever.py		retriever.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-Guard: Secure Retrieval-Augmented QA with Guardrails AI

What This Project Does

Architecture Diagram

Installation

1. Clone the Repositoy & Install Dependencies

2. Create & activate a Conda virtual environment

3. Environment Setup

4. Start Guardrails Server

5. Run the RAG Application

Example Usage

Example 1: Query outside allowed topics (Custom Topic Guard)

Example 2: A Refund Query (Full Guardrails + RAG Flow)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG-Guard: Secure Retrieval-Augmented QA with Guardrails AI

What This Project Does

Architecture Diagram

Installation

1. Clone the Repositoy & Install Dependencies

2. Create & activate a Conda virtual environment

3. Environment Setup

4. Start Guardrails Server

5. Run the RAG Application

Example Usage

Example 1: Query outside allowed topics (Custom Topic Guard)

Example 2: A Refund Query (Full Guardrails + RAG Flow)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages