Simple RAG Assistant

A clean, beginner-friendly implementation of a Retrieval-Augmented Generation (RAG) system for document-based question answering.

This project demonstrates how modern AI systems retrieve relevant information from documents and use a Large Language Model (LLM) to generate accurate, grounded answers instead of guessing. It is designed for learning, experimentation, and portfolio use.

📘 Ready Tensor Publication

This project is officially published on Ready Tensor:

🔗 https://app.readytensor.ai/publications/simple-rag-assistant-document-grounded-ai-for-question-answering-FAos5pUpSSAI

🎯 What Is This Project For?

This project helps you understand how RAG works in practice. It is useful if you want to:

Learn Retrieval-Augmented Generation (RAG)
Understand document-based question answering
Build a portfolio project for jobs or internships
Experiment with LLMs safely
Create a Google Colab or Kaggle demo

The assistant answers questions only from uploaded documents, reducing hallucinations and improving reliability.

🧠 What Can This Project Do?

Load documents in common formats:
- .txt
- .pdf
- .docx
Split documents into meaningful chunks
Convert text into vector embeddings
Store embeddings in a vector database
Retrieve the most relevant content for a query
Generate answers grounded in retrieved context
Switch between LLM providers without changing core logic

✨ Key Features

📄 Document ingestion
🔍 Semantic search using embeddings
🤖 Retrieval-Augmented Generation
🌍 Multiple LLM providers
🔒 Secure by design (no hardcoded API keys)
🧪 Ideal for experimentation and learning
📦 Works smoothly in Google Colab and Kaggle

🔧 Supported LLM Providers

The system supports multiple LLM APIs:

OpenAI
Groq
Google Gemini

The provider can be selected at runtime without modifying the RAG pipeline.

⚠️ Model availability depends on API support and provider lifecycle. The retrieval and grounding logic remains the same across providers.

🧠 How It Works (High Level)

Documents are loaded from a data/ folder
Text is split into overlapping chunks
Embeddings are created for each chunk
Embeddings are stored in a vector database (ChromaDB)
A user question retrieves the most relevant chunks
The LLM generates an answer only from retrieved content

This ensures:

Reduced hallucinations
Transparent reasoning
Document-grounded answers

▶️ Run on Google Colab

🛠️ Setup Instructions

Step 1: Open Google Colab

Visit https://colab.research.google.com
Open the provided notebook or upload RAG_Implementation_v2.ipynb.

Step 2: Install Dependencies

All required dependencies are listed in requirements.txt.

Run once:

!pip install -q -r requirements.txt

Step 3: Upload Documents

Create a folder named data/
Upload your .txt, .pdf, or .docx files using the file panel

Step 4: Choose an LLM Provider

Supported providers:

OpenAI
Groq
Google Gemini

Provide API keys securely using environment variables (never hardcode keys).

Step 5: Ask Questions

After documents are indexed, ask questions like:

“What is NLP?”
“Explain embeddings from the documents”
“Summarize the uploaded files”

The assistant responds only using your documents.

⚠️ Limitations

Not optimized for large-scale production
Requires valid API keys
Model availability may change
Designed mainly for educational and portfolio use

📜 License

This project is shared for educational and personal use.

🙌 Acknowledgements

This project uses open-source tools such as LangChain, ChromaDB, and sentence-transformers to demonstrate Retrieval-Augmented Generation in a simple and understandable way.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
.env_example		.env_example
.gitignore		.gitignore
LICENSE		LICENSE
RAG_Implementation_v2.ipynb		RAG_Implementation_v2.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple RAG Assistant

📘 Ready Tensor Publication

🎯 What Is This Project For?

🧠 What Can This Project Do?

✨ Key Features

🔧 Supported LLM Providers

🧠 How It Works (High Level)

▶️ Run on Google Colab

🛠️ Setup Instructions

Step 1: Open Google Colab

Step 2: Install Dependencies

Step 3: Upload Documents

Step 4: Choose an LLM Provider

Step 5: Ask Questions

⚠️ Limitations

📜 License

🙌 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

lookmohan/Simple-RAG-Assistant

Folders and files

Latest commit

History

Repository files navigation

Simple RAG Assistant

📘 Ready Tensor Publication

🎯 What Is This Project For?

🧠 What Can This Project Do?

✨ Key Features

🔧 Supported LLM Providers

🧠 How It Works (High Level)

▶️ Run on Google Colab

🛠️ Setup Instructions

Step 1: Open Google Colab

Step 2: Install Dependencies

Step 3: Upload Documents

Step 4: Choose an LLM Provider

Step 5: Ask Questions

⚠️ Limitations

📜 License

🙌 Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages