📝 Smart Notes Application

A powerful Flask-based notes application with advanced hybrid search capabilities and an AI-powered chatbot using OpenSearch and DeepSeek Chat for intelligent note exploration.

✨ Features

📝 Complete Note Management: Create, edit, delete, and organize your notes
🔍 Hybrid Search: Semantic + lexical search powered by OpenSearch
🤖 RAG Chatbot: AI assistant that answers questions using your notes via DeepSeek Chat
🏷️ Organization: Categories, tags, and color-coding for easy note organization
⭐ Favorites: Mark important notes for quick access
🗑️ Soft Delete: Recover accidentally deleted notes
👤 User Authentication: Secure login and registration system
💅 Modern UI: Beautiful interface with Tailwind CSS and custom SCSS
💬 Chat Sessions: Maintain conversation history with the AI chatbot

🏗️ Architecture

┌─────────────┐     ┌──────────────┐     ┌─────────────────┐
│   Browser   │────▶│ Flask App    │────▶│   SQLite DB     │
│  (Frontend) │     │  (Backend)   │     │  (Notes, Users) │
└─────────────┘     └──────────────┘     └─────────────────┘
                           │
                           ├──────────────▶┌─────────────────┐
                           │               │  OpenSearch     │
                           │               │  - Hybrid Search│
                           │               │  - Embeddings   │
                           │               └─────────────────┘
                           │
                           └──────────────▶┌─────────────────┐
                                           │  DeepSeek API   │
                                           │  - RAG Chatbot  │
                                           └─────────────────┘

🚀 Tech Stack

Backend

Flask 3.1.2 - Web framework
SQLAlchemy - ORM for database operations
SQLite - Lightweight database
Flask-Login - User session management

Search & AI

OpenSearch 3.4.0 - Hybrid search engine with ML capabilities
Sentence Transformers - Text embeddings (all-MiniLM-L6-v2)
DeepSeek Chat API - RAG-powered conversational AI

Frontend

Tailwind CSS - Utility-first CSS framework
SCSS - CSS preprocessing
Vanilla JavaScript - Interactive UI components

DevOps

Docker & Docker Compose - Containerization
Node.js & npm - Build tools

📋 Prerequisites

Ensure you have the following installed:

Python 3.8 or higher
Node.js 14.x or higher
Docker and Docker Compose
npm or yarn
DeepSeek API Key (get one from DeepSeek API)

🛠️ Installation

1. Clone the Repository

git clone <repository-url>
cd notebook

2. Create Virtual Environment

Windows:

python -m venv env
env\Scripts\activate

macOS/Linux:

python3 -m venv env
source env/bin/activate

3. Install Python Dependencies

pip install -r requirements.txt

4. Install Node.js Dependencies

npm install

5. Set Up Environment Variables

Create a .env file in the project root:

# Flask Configuration
SECRET_KEY=your-secret-key-here-change-in-production
SQLALCHEMY_DATABASE_URI=sqlite:///notebook.db

# OpenSearch Configuration
OPENSEARCH_HOST=localhost
OPENSEARCH_PORT=9200

# DeepSeek Configuration
DEEPSEEK_API_KEY=sk-your-deepseek-api-key-here
DEEPSEEK_MODEL=deepseek-chat

# Chatbot Configuration
MAX_CONTEXT_NOTES=5
MAX_CONTEXT_LENGTH=2000

6. Start OpenSearch

# Start OpenSearch in detached mode
docker-compose up -d

# Verify OpenSearch is running
curl http://localhost:9200

Wait 1-2 minutes for OpenSearch to fully initialize.

⚙️ OpenSearch Configuration

You have two options for configuring OpenSearch:

Option A: Automated Setup (Recommended)

Use the provided Python script to automatically configure everything:

# Make sure your virtual environment is activated
# Set your DeepSeek API key in environment
export DEEPSEEK_API_KEY=sk-your-deepseek-api-key-here  # Linux/macOS
# or
set DEEPSEEK_API_KEY=sk-your-deepseek-api-key-here     # Windows CMD
# or
$env:DEEPSEEK_API_KEY="sk-your-deepseek-api-key-here"  # Windows PowerShell

# Run the setup script
python opensearch_setup.py

The script will:

✅ Configure all cluster settings
✅ Register model groups and models
✅ Create pipelines and indexes
✅ Set up the DeepSeek connector
✅ Create the RAG search pipeline
✅ Output all environment variables for your .env file

Copy the generated environment variables to your .env file!

Option B: Manual Setup with cURL

If you prefer manual control or want to understand each step, follow these steps in order after OpenSearch is running:

Step 1: Configure ML Commons Settings

curl --location --request PUT 'http://localhost:9200/_cluster/settings' \
--header 'Content-Type: application/json' \
--data '{
  "persistent": {
    "plugins.ml_commons.allow_registering_model_via_url": "true",
    "plugins.ml_commons.only_run_on_ml_node": "false",
    "plugins.ml_commons.model_access_control_enabled": "true",
    "plugins.ml_commons.native_memory_threshold": "99"
  }
}'

Step 2: Register Model Group

curl --location 'localhost:9200/_plugins/_ml/model_groups/_register' \
--header 'Content-Type: application/json' \
--data '{
  "name": "note_search_with_highlighter",
  "description": "Models for note search with highlighter"
}'

📝 Save the model_group_id from the response (e.g., aHkiSpsBv7PT9JWEQJcl)

Step 3: Register Sentence Transformer Model

curl --location 'http://localhost:9200/_plugins/_ml/models/_register' \
--header 'Content-Type: application/json' \
--data '{
  "name": "huggingface/sentence-transformers/all-MiniLM-L6-v2",
  "version": "1.0.1",
  "model_format": "TORCH_SCRIPT",
  "model_group": "aHkiSpsBv7PT9JWEQJcl"
}'

Replace aHkiSpsBv7PT9JWEQJcl with your model_group_id.

📝 Save the task_id from the response (e.g., bXkjSpsBv7PT9JWEZZfG)

Step 4: Wait for Model Registration

curl --location 'http://localhost:9200/_plugins/_ml/tasks/bXkjSpsBv7PT9JWEZZfG'

Replace with your task_id. Wait until state: "COMPLETED".

📝 Save the model_id from the response (e.g., cXkjSpsBv7PT9JWEbZeJ)

Step 5: Create Embedding Pipeline

curl --location --request PUT 'http://localhost:9200/_ingest/pipeline/note-embedding-pipeline' \
--header 'Content-Type: application/json' \
--data '{
  "description": "Generate embeddings for note content",
  "processors": [
    {
      "text_embedding": {
        "model_id": "cXkjSpsBv7PT9JWEbZeJ",
        "field_map": {
          "content": "content_embedding"
        }
      }
    }
  ]
}'

Replace cXkjSpsBv7PT9JWEbZeJ with your model_id.

Step 6: Create Notes Index

curl --location --request PUT 'http://localhost:9200/notes' \
--header 'Content-Type: application/json' \
--data '{
  "settings": {
    "index": {
      "number_of_shards": 1,
      "number_of_replicas": 0,
      "default_pipeline": "note-embedding-pipeline",
      "knn": true
    },
    "analysis": {
      "analyzer": {
        "default": {
          "type": "standard"
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "note_id": {"type": "integer"},
      "title": {"type": "text", "analyzer": "standard"},
      "content": {"type": "text", "analyzer": "standard"},
      "text_to_embed": {"type": "text"},
      "embedding": {
        "type": "knn_vector",
        "dimension": 384,
        "method": {
          "name": "hnsw",
          "space_type": "cosinesimil",
          "engine": "lucene",
          "parameters": {
            "ef_construction": 128,
            "m": 24
          }
        }
      },
      "category": {"type": "keyword"},
      "tags": {"type": "keyword"},
      "user_id": {"type": "integer"},
      "created_at": {"type": "date"},
      "color": {"type": "keyword"}
    }
  }
}'

Step 7: Register Semantic Highlighter Model

curl --location 'http://localhost:9200/_plugins/_ml/models/_register?deploy=true' \
--header 'Content-Type: application/json' \
--data '{
  "name": "amazon/sentence-highlighting/opensearch-semantic-highlighter-v1",
  "version": "1.0.0",
  "model_format": "TORCH_SCRIPT",
  "function_name": "QUESTION_ANSWERING"
}'

📝 Save the task_id from the response (e.g., DRe_XJsBQ3Oda9xdkRE4)

Step 8: Wait for Highlighter Model Deployment

curl --location 'http://localhost:9200/_plugins/_ml/tasks/DRe_XJsBQ3Oda9xdkRE4'

Replace DRe_XJsBQ3Oda9xdkRE4 with your task_id. Wait until state: "COMPLETED".

📝 Save the model_id from the response (e.g., Dhe_XJsBQ3Oda9xdmBFJ)

Step 9: Create Hybrid RRF Pipeline

curl --location --request PUT 'http://localhost:9200/_search/pipeline/hybrid-rrf-pipeline' \
--header 'Content-Type: application/json' \
--data '{
  "description": "Post processor for hybrid RRF search",
  "phase_results_processors": [
    {
      "score-ranker-processor": {
        "combination": {
          "technique": "rrf"
        }
      }
    }
  ]
}'

🤖 DeepSeek RAG Chatbot Configuration

Step 10: Create DeepSeek Connector

curl --location 'localhost:9200/_plugins/_ml/connectors/_create' \
--header 'Content-Type: application/json' \
--data '{
  "name": "DeepSeek Chat",
  "description": "Connector for DeepSeek Chat API",
  "version": "1",
  "protocol": "http",
  "parameters": {
    "endpoint": "api.deepseek.com",
    "model": "deepseek-chat"
  },
  "credential": {
    "deepSeek_key": "sk-your-deepseek-api-key-here"
  },
  "actions": [
    {
      "action_type": "predict",
      "method": "POST",
      "url": "https://${parameters.endpoint}/v1/chat/completions",
      "headers": {
        "Content-Type": "application/json",
        "Authorization": "Bearer ${credential.deepSeek_key}"
      },
      "request_body": "{\"model\": \"${parameters.model}\", \"messages\": ${parameters.messages}}"
    }
  ]
}'

📝 Save the connector_id from the response (e.g., 9ockcJsByj6d9U--59XV)

Step 11: Register DeepSeek Model

curl --location 'http://localhost:9200/_plugins/_ml/models/_register?deploy=true' \
--header 'Content-Type: application/json' \
--data '{
  "name": "DeepSeek Chat model",
  "function_name": "remote",
  "description": "DeepSeek Chat",
  "model_group": "aHkiSpsBv7PT9JWEQJcl",
  "connector_id": "9ockcJsByj6d9U--59XV"
}'

Replace aHkiSpsBv7PT9JWEQJcl with your model_group_id and 9ockcJsByj6d9U--59XV with your connector_id.

📝 Save the task_id from the response

Step 12: Wait for Model Deployment

curl --location 'http://localhost:9200/_plugins/_ml/tasks/YOUR_TASK_ID'

Wait until state: "COMPLETED".

📝 Save the model_id from the response (e.g., -ocscJsByj6d9U--oNXX)

Step 13: Create RAG Search Pipeline

curl --location --request PUT 'http://localhost:9200/_search/pipeline/my-conversation-search-pipeline-deepseek-chat' \
--header 'Content-Type: application/json' \
--data '{
  "response_processors": [
    {
      "retrieval_augmented_generation": {
        "tag": "Notes RAG Pipeline",
        "description": "RAG pipeline using DeepSeek Chat",
        "model_id": "-ocscJsByj6d9U--oNXX",
        "context_field_list": ["title", "content", "category", "tags"],
        "system_prompt": "You are a helpful assistant that helps users explore and understand their personal notes. Use the provided context from the user'\''s notes to answer questions accurately and helpfully. Reference specific notes by title when relevant. If the context doesn'\''t contain enough information, acknowledge this politely. Be conversational and helpful.",
        "user_instructions": "Answer based on these notes from my collection"
      }
    }
  ]
}'

Replace -ocscJsByj6d9U--oNXX with your DeepSeek model_id.

Step 14: Test the RAG Pipeline

curl --location 'http://localhost:9200/notes/_search?search_pipeline=my-conversation-search-pipeline-deepseek-chat' \
--header 'Content-Type: application/json' \
--data '{
  "query": {
    "match": {
      "content": "test"
    }
  },
  "ext": {
    "generative_qa_parameters": {
      "llm_question": "What notes do I have?",
      "llm_model": "deepseek-chat"
    }
  }
}'

✅ Configuration Complete! Proceed to the next section to build and run the application.

🎨 Build and Run

1. Build CSS Assets

# One-time build
npm run build

# Watch mode (auto-rebuild on changes)
npm run watch

2. Start the Application

python app.py

The application will be available at http://localhost:5000

3. Default Login

Username: ismail
Password: ismail

⚠️ Important: Change these credentials in production!

📁 Project Structure

notebook/
├── app/
│   ├── __init__.py                 # Flask app factory
│   ├── models/                     # Database models
│   │   ├── user.py
│   │   ├── note.py
│   │   ├── category.py
│   │   ├── chat_session.py
│   │   └── chat_message.py
│   ├── routes/                     # Flask blueprints
│   │   ├── auth.py                 # Authentication
│   │   ├── main.py                 # Home/dashboard
│   │   ├── note.py                 # Note CRUD
│   │   └── chatbot.py              # Chatbot API
│   ├── services/                   # Business logic
│   │   ├── auth_service.py
│   │   ├── note_service.py
│   │   ├── category_service.py
│   │   ├── opensearch_service.py
│   │   ├── rag_service.py
│   │   ├── session_service.py
│   │   └── message_service.py
│   ├── static/                     # Static assets
│   │   ├── dist/                   # Compiled CSS
│   │   ├── src/                    # SCSS source
│   │   ├── js/                     # JavaScript
│   │   └── images/                 # Icons & images
│   ├── templates/                  # Jinja2 templates
│   │   ├── base.html
│   │   ├── index.html
│   │   ├── login.html
│   │   ├── register.html
│   │   └── chatbot_popup.html
│   └── utils/                      # Helper utilities
│       └── Note_search_result.py
├── instance/
│   └── notebook.db                 # SQLite database
├── app.py                          # Application entry point
├── config.py                       # Configuration
├── docker-compose.yml              # OpenSearch container
├── requirements.txt                # Python dependencies
├── package.json                    # Node.js dependencies
├── tailwind.config.js              # Tailwind configuration
└── .env                            # Environment variables

🔧 Configuration

Environment Variables

Edit .env to customize:

# Security
SECRET_KEY=your-secret-key

# Database
SQLALCHEMY_DATABASE_URI=sqlite:///notebook.db

# OpenSearch
OPENSEARCH_HOST=localhost
OPENSEARCH_PORT=9200

# DeepSeek
DEEPSEEK_API_KEY=sk-your-key
DEEPSEEK_MODEL=deepseek-chat

# RAG Settings
MAX_CONTEXT_NOTES=5
MAX_CONTEXT_LENGTH=2000

🎯 Features Deep Dive

Hybrid Search

Combines two search methods:

Lexical Search: Traditional keyword matching using BM25
Semantic Search: Vector similarity using sentence embeddings
RRF Fusion: Reciprocal Rank Fusion combines results intelligently

RAG Chatbot

The chatbot uses Retrieval-Augmented Generation:

Retrieval: Searches your notes using hybrid search
Augmentation: Adds relevant notes as context
Generation: DeepSeek Chat generates responses based on your notes

Benefits:

Answers are grounded in your actual notes
Cites specific notes when relevant
Maintains conversation history
Understands natural language queries

🐛 Troubleshooting

OpenSearch Won't Start

# Check container status
docker ps -a

# View logs
docker-compose logs -f opensearch

# Restart
docker-compose restart opensearch

Model Registration Fails

Ensure sufficient memory (512MB minimum)
Check Java heap settings in docker-compose.yml
Verify ML Commons plugin is active

Chatbot Not Working

Verify DeepSeek API key is valid
Check RAG pipeline is created correctly
Test connector separately first
Review application logs for errors

CSS Not Loading

# Rebuild CSS
npm run build

# Check for errors
npm run watch

Database Issues

# Reset database (deletes all data!)
rm instance/notebook.db
python app.py

📊 API Endpoints

Notes API

GET / - Dashboard with all notes
POST /notes/create - Create new note
PUT /notes/<id>/edit - Edit note
DELETE /notes/<id>/delete - Soft delete note
POST /notes/search - Search notes

Chatbot API

GET /api/chatbot/health - Health check
POST /api/chatbot/chat - Send message to chatbot
GET /api/chatbot/sessions - List chat sessions
POST /api/chatbot/sessions - Create new session

Authentication API

POST /auth/register - User registration
POST /auth/login - User login
GET /auth/logout - User logout

🚀 Deployment

Production Checklist

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Commit changes: git commit -am 'Add feature'
Push to branch: git push origin feature-name
Submit a pull request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenSearch - Powerful search and analytics engine
DeepSeek - Advanced language model API
Flask - Lightweight web framework
Sentence Transformers - State-of-the-art text embeddings
Tailwind CSS - Modern utility-first CSS framework

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
docker-compose.yml		docker-compose.yml
notes.json		notes.json
opensearch_setup.py		opensearch_setup.py
package.json		package.json
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js

IsmailKattan/Notebook

Folders and files

Latest commit

History

Repository files navigation