Cut LLM costs by up to 80% and unlock sub-millisecond responses with intelligent semantic caching. A drop-in OpenAI-compatible proxy written in Go.
-
Updated
Nov 24, 2025 - Go
Cut LLM costs by up to 80% and unlock sub-millisecond responses with intelligent semantic caching. A drop-in OpenAI-compatible proxy written in Go.
Web2LLM.txt – A fast, open-source website-to-LLM context file generator. Paste any https:// URL and instantly get a clean llm.txt file with token & cost estimation—ideal for RAG, prompt engineering, and AI training workflows.
EarningsAI Demo is a powerful tool that combines audio transcription, document processing, and AI-powered analysis to help users extract insights from earnings calls and financial documents. Built with Fireworks AI and MongoDB, it provides both a command-line interface and a web application for processing and querying financial data.
Prism (Personal Retrieval & Insight System for Multimedia). This project is currently in progress. Offline Al-powered RAG system that links text, image & audio knowledge - built with FastAPI and local LLMs.
AI driven Content Creation Automation.
Add a description, image, and links to the rag-ai topic page so that developers can more easily learn about it.
To associate your repository with the rag-ai topic, visit your repo's landing page and select "manage topics."