A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
-
Updated
Feb 14, 2026 - Python
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
Open source document processing pipeline for the Epstein case files. Download OCR, extract entities, deduplicate and export documents from the DOJ Releases
FULL_EPSTEIN_INDEX is a comprehensive, unified research archive aggregating public releases related to the Jeffrey Epstein estate and associated investigations.
Play Bad Apple and DOOM on redacted Epstein files and other documents. Implemented using KNN and feature vector.
Download all Epstein files, images, pdfs, and more!
Obsidian plugin to hide text using strikethrough-style masking
This repository serves as a comprehensive directory and organizational hub for Epstein-related files and documentation with a full web interface, automated workflows, and AI agent infrastructure.
downloads .pdf files from DOJ website / epstein data-sets
Download all of the Jeffrey Epstein court records with this Python script! Mirror of https://git.graveyard.sh/OfficialB/doj-epstein-crawler
Epstein email archive explorer — scraper, SQLite FTS5 indexer, FastAPI + D3.js web UI
A data-driven audit of the 'Geopolitical Thermostat,' documenting how timed information disclosure regulates public attention to enable structural shifts in policy and capital flows.
Public disclosures for The Loose Thread Project: Tiered dossiers on The Syndicate, Operators, Deep Web, Institutional Penetration, Exponential Expansion, and more. ZIP archive + individual DOCX files.
Interactive D3.js visualization of the Epstein Files Transparency Act (EFTA) documents. Features a document similarity network using TF-IDF cosine similarity with community detection, and an entity relationship network mapping people and organizations. Explore connections through force-directed layouts with filtering, search, and entity overlays.
Ask questions about the Epstein Files using AI - A RAG pipeline with hybrid search, re-ranking, and Streamlit UI built on the Epstein Files 20K dataset.
it's just a forked repo
Download documents from the Epstein Files 2026. CVS Direct Downloader or via web.
A reserch to the famous name on the epstein files library
discord bot that permit users search to epstein files
Add a description, image, and links to the epstein-files topic page so that developers can more easily learn about it.
To associate your repository with the epstein-files topic, visit your repo's landing page and select "manage topics."