Skip to content

anikrish05/LLMs.txt-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Automated llms.txt Generator

An automated llms.txt generator that crawls websites, extracts key pages and metadata, and generates spec-compliant llms.txt files.

Architecture

This is a monorepo containing two services:

Service Description Deployment
profound-frontend/ Next.js web app with dashboard and API Vercel
crawler-service/ Playwright-based crawler microservice Railway
┌─────────────────┐         ┌──────────────────┐
│                 │  HTTP   │                  │
│  Vercel         │────────▶│  Railway         │
│  (Frontend)     │         │  (Crawler)       │
│                 │         │                  │
└────────┬────────┘         └────────┬─────────┘
         │                           │
         │ Prisma                    │ Prisma
         ▼                           ▼
┌───────────────────────────────────────────────┐
│              Supabase (PostgreSQL)            │
└───────────────────────────────────────────────┘

Quick Start

1. Clone the repository

git clone https://github.com/anikrish05/ProfoundTakeHome.git
cd ProfoundTakeHome

2. Start the crawler service

cd crawler-service
npm install
npx playwright install chromium
cp .env.example .env  # Configure DATABASE_URL and API_KEY
npm run dev

3. Start the frontend

cd profound-frontend
npm install
cp .env.example .env  # Configure your environment variables
npx prisma generate
npm run dev

Open http://localhost:3000

Documentation

Features

  • Public Generation: Generate llms.txt for any website without signing up
  • Site Management: Add and manage multiple websites to monitor
  • Automated Crawling: Scheduled crawls detect changes and regenerate automatically
  • Bot Protection Bypass: Playwright-based crawler handles JavaScript-heavy and protected sites
  • Spec Compliant: Output follows the llmstxt.org standard

Tech Stack

  • Frontend: Next.js 15, React 19, Tailwind CSS, shadcn/ui
  • Database: PostgreSQL with Prisma ORM
  • Auth: Supabase Auth
  • Crawler: Playwright, Express.js, Cheerio
  • Deployment: Vercel + Railway

🎥 Demo

Click to watch a short walkthrough of the project:

LLMs.txt Generator Demo

Automatically generates a compliant llms.txt file from a given website in seconds.

Releases

No releases published

Packages

 
 
 

Contributors

Languages