AI-powered analysis of investment documents (CIMs, investment memos) using NLP and LLMs.
Investment professionals spend 20+ hours manually reviewing 100-300 page deal documents, creating bottlenecks in deal screening.
Automated system that extracts financial metrics, flags risks, and generates AI summaries.
Result: 20+ hours → 5 minutes (99.6% time reduction)
- Frontend: Streamlit
- LLMs: OpenAI GPT-4
- NLP: spaCy, FinBERT
- PDF Processing: pdfplumber
- Database: DuckDB
git clone https://github.com/leaalonzo/financial-doc-analyzer.git
cd financial-doc-analyzer
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python -m spacy download en_core_web_sm
export OPENAI_API_KEY=sk-your-key-herecp .env.example .envOPENAI_API_KEY="sk-your-actual-key-here"streamlit run app.pyUpload a PDF and get analysis in ~1 minute.
- AI-Generated Executive Summary (GPT-4)
- Financial metric extraction
- Risk detection (15+ categories)
- Sentiment analysis (FinBERT)
- Entity recognition
- 20+ hours → 5 minutes per document
- 10x deal throughput increase
- Standardized analysis framework
MIT
Disclaimer: Educational project. Not financial advice.