Trade-portfolio Chatbot

A natural language chatbot for querying portfolio holdings and trades data using advanced RAG (Retrieval-Augmented Generation) architecture for dynamic SQL generation.

Why RAG-Enhanced Dynamic SQL Generation Architecture?

We implemented RAG-Enhanced Dynamic SQL Generation architecture for these key reasons:

Dynamic SQL: Generates queries in real-time works with any schema without hardcoding.
Context-Aware: Handles complex, unseen questions by understanding schema relationships.
RAG-Driven: Retrieves similar valid SQL examples to improve generation accuracy.
Secure: Implements multi-layer SQL injection prevention.
Self-Learning: Automatically feeds successful queries back into the system to improve future performance.

Technical Stack

Component	Technology	Purpose
LLM	Groq Llama 3.3 70B	Natural language to SQL conversion
Embeddings	SentenceTransformers (all-MiniLM-L6-v2)	Convert text to vectors for RAG
Vector Store	FAISS	Fast similarity search for SQL examples
Database	SQLite	Store and query CSV data
Framework	LangChain	Orchestrate LLM and database interactions
Data Processing	Pandas, NumPy	CSV manipulation and cleaning

🏗️ System Architecture Diagram

┌──────────────────────────────────────────────────────────────┐
│                         CSV FILES                            │
│              holdings.csv (1,022 rows × 25 cols)             │
│               trades.csv (649 rows × 31 cols)                │
└──────────────────┬───────────────────────────────────────────┘
                   │ Pandas ETL Pipeline
                   ↓
┌──────────────────────────────────────────────────────────────┐
│                   SQLITE DATABASE                            │
│  • Column name sanitization (remove spaces/special chars)    │
│  • Type inference (INTEGER, REAL, TEXT)                      │
│  • Indexed tables for fast queries                           │
└──────────────────┬───────────────────────────────────────────┘
                   │
        ┌──────────┴──────────┐
        ↓                     ↓
┌──────────────────┐  ┌──────────────────────────┐
│  SCHEMA EXTRACTOR│  │  SQL SAMPLE GENERATOR    │
│  • Table names   │  │  • Pattern detection     │
│  • Column types  │  │  • 5 base examples       │
│  • Relationships │  │  • Question templates    │
└────────┬─────────┘  └──────────┬───────────────┘
         │                       │
         │                       ↓
         │            ┌──────────────────────────┐
         │            │   SENTENCE TRANSFORMER   │
         │            │   • Encode to vectors    │
         │            │   • 384-dim embeddings   │
         │            └──────────┬───────────────┘
         │                       │
         │                       ↓
         │            ┌──────────────────────────┐
         │            │      FAISS INDEX         │
         │            │   • L2 distance search   │
         │            │   • Top-k retrieval      │
         │            │   • Self-learning updates│
         │            └──────────┬───────────────┘
         │                       │
         └───────────────────────┘
                   │
                   ↓
         ┌─────────────────────────┐
         │  PROMPT BUILDER         │
         │  Schema + Examples +    │
         │  Question → LLM Prompt  │
         └─────────┬───────────────┘
                   │
                   ↓
         ┌─────────────────────────┐
         │   GROQ LLM API          │
         │   Llama 3.3 70B         │
         │   Temperature: 0        │
         └─────────┬───────────────┘
                   │
                   ↓ SQL Query
         ┌─────────────────────────┐
         │  SQL GUARDRAILS         │
         │  • Starts with SELECT?  │
         │  • No forbidden words?  │
         │  • Auto-LIMIT injection │
         └─────────┬───────────────┘
                   │
                   ↓
         ┌─────────────────────────┐
         │  SQL EXECUTOR           │
         │  pandas.read_sql_query  │
         └─────────┬───────┬───────┘
                   │       │ (Success)
                   ↓       ↓
         ┌─────────┴───────┴───────┐
         │  SELF-LEARNING RAG      │
         │  • Stores valid SQL     │
         │  • Updates Vector Store │
         └─────────┬───────────────┘
                   │
                   ↓
         ┌─────────────────────────┐
         │  RESULT FORMATTER       │
         │  • Single value         │
         │  • Table (≤10 rows)     │
         │  • Preview (>10 rows)   │
         └─────────┬───────────────┘
                   │
                   ↓
              USER ANSWER

🚀 Quick Start

Option 1: Run with Docker (Recommended)

No Python installation needed. Just install Docker.

# 1. Clone the repo
git clone <repository_url>
cd <project_directory>

# 2. Add your API Key
# Create a .env file and add: GROQ_API_KEY=your_key_here

# 3. Run the App
docker-compose up --build

Then open the Jupyter link printed in the terminal.

Option 2: Run Manually (Local Python)

1. Setup Environment

# Windows
py -3.11 -m venv env311
.\env311\Scripts\activate

# Mac/Linux
python3.11 -m venv env311
source env311/bin/activate

2. Install Dependencies

pip install -r requirements.txt

3. Setup API Key Create a .env file in the project root:

GROQ_API_KEY=your_api_key_here

4. Run the Notebook

jupyter notebook trade_portfolio.ipynb

Performance Metrics

Database Size: 1,671 total records (1,022 holdings + 649 trades)
Schema Columns: 56 total columns across 2 tables
RAG Examples: 5 auto-generated SQL samples
Embedding Dimension: 384 (MiniLM model)
Query Latency: ~2-5 seconds (Groq API + SQL execution)
Accuracy: 100% on in-scope questions, 100% fallback on out-of-scope

How It Works

1. Data Ingestion

holdings_df = pd.read_csv("holdings.csv")  # 1,022 rows × 25 columns
trades_df = pd.read_csv("trades.csv")      # 649 rows × 31 columns

2. Schema Auto-Detection

schema_info = {
    'holdings': {
        'columns': ['AsOfDate', 'PortfolioName', 'MV_Base', 'PL_YTD', ...]
    },
    'trades': {
        'columns': ['TradeTypeName', 'SecurityId', 'Quantity', ...]
    }
}

3. SQL Sample Generation

sql_samples = [
    {
        "question": "How many total holdings are there?",
        "sql": "SELECT COUNT(*) FROM holdings;",
        "explanation": "Counts all rows"
    },
    # ... 4 more auto-generated samples
]

4. RAG Vector Store Creation

embedding_model = SentenceTransformer('all-MiniLM-L6-v2')
embeddings = embedding_model.encode([sample['question'] for sample in sql_samples])
index = faiss.IndexFlatL2(dimension=384)
index.add(embeddings)

5. Query Processing with Self-Learning

def querygpt_chatbot(question):
    # Step 1: Retrieve similar SQL examples (RAG)
    relevant_docs = retrieve_relevant_examples(question, top_k=2)
    
    # Step 2: Build prompt with schema + examples
    prompt = f"{schema_text}\n{examples}\nQUESTION: {question}"
    
    # Step 3: Generate SQL via LLM
    sql = llm.invoke(prompt).content.strip()
    
    # Step 4: SQL Guardrails (Safety + Auto-LIMIT)
    if not is_safe_sql(sql):  # Block DROP, DELETE, etc.
        return "Only SELECT queries allowed"
    sql = enforce_limit(sql, limit=100)  # Add LIMIT if missing
    
    # Step 5: Execute query
    result = pd.read_sql_query(sql, engine)
    
    # Step 6: Self-Learning RAG - Store successful queries
    add_success_to_rag(question, sql)  # Updates vector store dynamically
    
    # Step 7: Format and return results
    return format_result(result)

6. Future Optimizations

The following optimizations can be done when dealing with larger datasets, currently not executed due to the small dataset size.

1. Intent Classification Agent: Routes queries into COUNT / PNL / FILTER / TOP-N categories. Saves LLM tokens and reduces SQL retries.

2. Table Selection Agent: Selects only relevant tables before SQL generation. Reduces schema size and confusion.

3. Column Pruning Agent: Sends only relevant columns instead of full schemas. Major token cost savings in large databases.

4. Metadata Gateway: Dynamically injects schema, data types, statistics, and sample rows. Prevents hallucinated columns.

5. Retry with Schema Expansion: If SQL fails due to missing columns, retry with expanded schema.

6. Query Audit Logging: Logs question → SQL → execution time → status for monitoring.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
holdings.csv		holdings.csv
requirements.txt		requirements.txt
trade_portfolio.ipynb		trade_portfolio.ipynb
trades.csv		trades.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trade-portfolio Chatbot

Why RAG-Enhanced Dynamic SQL Generation Architecture?

Technical Stack

🏗️ System Architecture Diagram

🚀 Quick Start

Option 1: Run with Docker (Recommended)

Option 2: Run Manually (Local Python)

Performance Metrics

How It Works

1. Data Ingestion

2. Schema Auto-Detection

3. SQL Sample Generation

4. RAG Vector Store Creation

5. Query Processing with Self-Learning

6. Future Optimizations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Trade-portfolio Chatbot

Why RAG-Enhanced Dynamic SQL Generation Architecture?

Technical Stack

🏗️ System Architecture Diagram

🚀 Quick Start

Option 1: Run with Docker (Recommended)

Option 2: Run Manually (Local Python)

Performance Metrics

How It Works

1. Data Ingestion

2. Schema Auto-Detection

3. SQL Sample Generation

4. RAG Vector Store Creation

5. Query Processing with Self-Learning

6. Future Optimizations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages