📄 Document Summarizer App

Built with Streamlit, LangChain, Hugging Face Transformers, and PyTorch

📌 Project Overview

This is an AI-powered PDF Document Summarizer App that extracts text from PDF files and generates concise summaries using the LaMini-Flan-T5 model. The app is built with Streamlit for an interactive web interface and integrates LangChain for smart text chunking and Transformers for natural language processing.

🚀 Key Features

📄 Upload and preview PDF documents directly in-browser
✂️ Intelligent document chunking using LangChain
🤖 Summarization using LaMini-Flan-T5 from Hugging Face
⚡ Efficient text preprocessing to avoid token overflow
🧠 Built-in PDF viewer for side-by-side comparison
🌐 Streamlit-powered UI for fast deployment

🧑‍💻 Tech Stack

Frontend: Streamlit
NLP: Hugging Face Transformers (pipeline API), LaMini-Flan-T5
Text Preprocessing: LangChain (RecursiveCharacterTextSplitter)
PDF Parsing: PyPDFLoader (LangChain Community)
Frameworks: PyTorch
Other Tools: Base64 encoding for PDF rendering

📂 How It Works

📤 Upload a .pdf file via the Streamlit interface
📚 Text is extracted and chunked using RecursiveCharacterTextSplitter
🤖 The summarization pipeline runs with T5ForConditionalGeneration
📄 The original PDF and the generated summary are displayed side by side

🛠️ Installation & Setup

# Clone the repository
git clone https://github.com/yourusername/document-summarizer-app.git
cd document-summarizer-app

# Create a virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows

# Install required packages
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 Document Summarizer App

Built with Streamlit, LangChain, Hugging Face Transformers, and PyTorch

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 Document Summarizer App

Built with Streamlit, LangChain, Hugging Face Transformers, and PyTorch

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages