ML-Powered Research Assistant

The ML-Powered Research Assistant is a web application designed to assist researchers by analyzing indiviual and multiple PDF research documents. It provides individual summaries, a comparative final summary, sentiment analysis, keyword extraction, and a RAG-powered chatbot to answer questions about the documents. The backend is built with FastAPI and leverages machine learning models for natural language processing, while the frontend is developed using Next.js with TypeScript and Tailwind CSS for a modern, responsive UI.

Features

Upload Multiple PDFs: Upload multiple research documents (PDFs) for analysis.
Individual Summaries: Generate concise summaries for each uploaded document.
Final Summary: Compare key themes and findings across all documents in a comparative summary.
Sentiment Analysis: Analyze the sentiment of the summaries using a pre-trained DistilBERT model.
Keyword Extraction: Extract relevant keywords from each document using spaCy.
RAG-Powered Chatbot: Chat with a Retrieval-Augmented Generation (RAG) chatbot to ask questions about the documents, powered by LLMs and embeddings.

Tech Stack

Backend

Framework: FastAPI (Python)
LLM: Ollama (LLaMA 3.3 for text generation, Nomic Embed for embeddings)
NLP Libraries:
- llama-index: For document indexing, summarization, and querying.
- spaCy: For keyword extraction.
- transformers: For sentiment analysis using DistilBERT.
PDF Processing: PyPDF2
Environment Management: python-dotenv for environment variables

Frontend

Framework: Next.js (React with TypeScript)
Styling: Tailwind CSS
Components: Custom React components (chat-interface.tsx, document-uploader.tsx, summary-panel.tsx, theme-provider.tsx)
State Management: React hooks
Build Tools: TypeScript, PostCSS, ESLint

Prerequisites

Python: 3.8+ (for the backend)
Node.js: 18+ (for the frontend)
Git: For cloning the repository
Ollama Server: Access to an Ollama server for LLM and embedding models (or a local setup)

Setup Instructions

1. Clone the Repository

git clone https://github.com/abm1499/ML-Powered-Research-Assistant.git
cd ML-Powered-Research-Assistant

2. Backend Setup

Navigate to the backend directory:

cd backend

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

If requirements.txt is missing, install the required packages manually:

pip install fastapi uvicorn llama-index langchain-community pypdf2 requests python-dotenv spacy transformers torch
python -m spacy download en_core_web_sm

Create a .env file in the backend directory with the following:

LLM_API_URL=<your-ollama-llm-api-url>
EMBEDDING_API_URL=<your-ollama-embedding-api-url>

Replace and with the URLs of your Ollama server (e.g., http://localhost:11434 if running locally).

Run the backend server:

uvicorn main:app --host 0.0.0.0 --port 8000 --reload

2. Frontend Setup

Navigate to the frontend directory:

cd frontend

Install dependencies:

npm install

Run the frontend development server:

npm run dev

Usage

1. Open the frontend in your browser (http://localhost:3000).

2. Use the document uploader to upload one or more PDF research documents.

3. View the analysis results:

Summaries: Individual summaries for each document.

Final Summary: A comparative summary highlighting key themes and differences.

Sentiment Analysis: Sentiment of the summaries (positive, negative, or neutral).

Keywords: Extracted keywords from each document.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
1.png		1.png
2.png		2.png
3.png		3.png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-Powered Research Assistant

Features

Tech Stack

Backend

Frontend

Prerequisites

Setup Instructions

1. Clone the Repository

2. Backend Setup

2. Frontend Setup

Usage

1. Open the frontend in your browser (http://localhost:3000).

2. Use the document uploader to upload one or more PDF research documents.

3. View the analysis results:

4. Use the chat interface to ask questions about the documents (e.g., "What are the main findings?").

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ML-Powered Research Assistant

Features

Tech Stack

Backend

Frontend

Prerequisites

Setup Instructions

1. Clone the Repository

2. Backend Setup

2. Frontend Setup

Usage

1. Open the frontend in your browser (http://localhost:3000).

2. Use the document uploader to upload one or more PDF research documents.

3. View the analysis results:

4. Use the chat interface to ask questions about the documents (e.g., "What are the main findings?").

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages