This project monitors the FedRAMP Rev5 documentation websites for changes, providing detailed tracking of documentation updates and public notices.
If you are an AI agent (Claude, Cursor, etc.) working on this project:
-
READ THESE FILES IMMEDIATELY BEFORE DOING ANYTHING:
CLAUDE.md- Mandatory agent instructions and workflowspec.md- Complete operational specificationsnapshots/latest.json- Current baseline pointer
-
CRITICAL: When user asks "What has changed?"
- You MUST do a complete fresh crawl of BOTH websites
- You MUST create a NEW snapshot with current timestamp
- You MUST update
snapshots/latest.json - See CLAUDE.md for step-by-step requirements
This tool monitors TWO FedRAMP websites:
- https://www.fedramp.gov/docs/rev5/ (Rev5 Documentation - 40-50+ pages)
- https://www.fedramp.gov/notices/ (Public Notices - single page)
The specification file (spec.md) provides:
- Complete monitoring workflow and requirements
- Snapshot creation process
- Change detection methodology
- Site navigation and structure
- Document categories and types
- URL patterns for different resources
- CSS selectors for web scraping
- Best practices for responsible querying
.
βββ README.md # This file
βββ CLAUDE.md # AI agent instructions (READ FIRST)
βββ .cursorrules # Cursor-specific rules
βββ spec.md # Complete specification and workflow
βββ snapshots/ # Snapshot storage
βββ latest.json # Pointer to current baseline
βββ YYYY-MM-DDTHHMMSSZ-*/ # Timestamped snapshots
- CLAUDE.md - Mandatory instructions for AI agents. Read this first in every new session.
- spec.md - Complete operational specification with detailed monitoring workflow.
- snapshots/latest.json - Points to the most recent snapshot (baseline for comparisons).
- snapshots/[timestamp]-update/ - Timestamped snapshot directories containing fetched website content.
The spec organizes documents by audience:
For Cloud Service Providers:
- Getting Started:
/playbook/csp/authorization/overview/ - System Security Plan:
/playbook/csp/authorization/ssp/ - Continuous Monitoring:
/playbook/csp/continuous-monitoring/
For Agencies:
- Marketplace Info:
/playbook/agency/marketplace/overview/ - Sponsorship:
/playbook/agency/sponsorship/ - Authorization:
/playbook/agency/authorization/
Using the URL pattern from the spec:
Pattern: /playbook/{audience}/{process}/{topic}/
Examples:
- https://www.fedramp.gov/playbook/csp/authorization/ssp/
- https://www.fedramp.gov/playbook/agency/marketplace/overview/
- https://www.fedramp.gov/playbook/csp/continuous-monitoring/poam/
The spec identifies these key document types:
- Playbooks - Comprehensive guides (HTML/PDF)
- SSP - System Security Plan templates
- SAP - Security Assessment Plan templates
- SAR - Security Assessment Report templates
- POA&M - Plans of Action & Milestones
From the spec, use these selectors:
/* Navigation menu */
nav
/* Main content area */
main, article, .content
/* Document links */
a[href*='/playbook/']
/* PDF downloads */
a[href$='.pdf']
/* Section headings */
h1, h2, h3Using the spec to build a simple query:
-
Identify your target audience:
- Cloud Service Providers β
/playbook/csp/ - Agencies β
/playbook/agency/
- Cloud Service Providers β
-
Select the process:
- Authorization
- Continuous Monitoring
- Marketplace
- Sponsorship
-
Choose the topic:
- SSP, SAP, SAR, POA&M
- Overview, procedures, etc.
-
Construct the full URL:
https://www.fedramp.gov/playbook/{audience}/{process}/{topic}/
Based on extraction targets in the spec:
Get all document titles:
- Selector:
h1, h2 - Extract: text content
Get all document links:
- Selector:
a[href$='.pdf'], a[href*='/playbook/'] - Extract:
hrefattribute
Get navigation structure:
- Selector:
nav a, .nav-link - Extract:
hrefand text content
Requests per second: 1
Delay between requests: 1000msAlways identify your scraper:
User-Agent: FedRAMP-Documentation-Bot/1.0 (Research Purpose)
Enable caching to minimize redundant requests:
- Cache duration: 15 minutes
- Reduces server load
Check before scraping: https://www.fedramp.gov/robots.txt
The FedRAMP Rev5 documentation is organized into these main sections:
- General Info - Changelog, logos, foundational documents
- Authority & Responsibility - Legal framework and guidance
- FedRAMP 20x - Next-generation process materials
- FedRAMP Rev5 - Current revision playbooks and guides
- Getting Started guides
- Authorization Package documentation
- Continuous Monitoring resources
- Marketplace information
- Liaison program details
- Sponsorship procedures
- Authorization processes
Optional and mandatory enhancements bringing FedRAMP 20x features to Rev5
The spec supports these output formats:
- JSON
- CSV
- Markdown
Recommended fields to extract:
- title
- url
- category
- document_type
- audience
- last_modified
- content_summary
Start with CSP or Agency section, then drill down by process and topic.
Search for specific document types (SSP, SAP, SAR, POA&M).
Build URLs using the pattern and known values.
- The site uses tab-based navigation with local storage for user preferences
- Balance Improvement Releases bridge FedRAMP 20x and Rev5 processes
- Content is primarily organized by audience (CSP vs Agency)
- Playbooks are available in both HTML and PDF formats
- Site structure may change; verify selectors periodically
When you ask "What has changed?" or "Check for changes":
- The agent checks the current date/time
- Loads the baseline from
snapshots/latest.json - Crawls BOTH websites fresh from fedramp.gov
- Creates a new timestamped snapshot
- Compares new vs baseline snapshot
- Reports changes detected
- Updates
latest.jsonto point to new snapshot
Read CLAUDE.md first. It contains the mandatory workflow you must follow.
Critical requirements:
- Always check current date with
date -ubefore doing anything - Always create a NEW snapshot when checking for changes
- Always fetch fresh content from websites (don't just compare old snapshots)
- Always update
snapshots/latest.jsonafter creating snapshot - Always report temporal context ("Changes since [date] ([X days] ago)")
-
Rev5 Documentation (40-50+ pages)
- Base: https://www.fedramp.gov/docs/rev5/
- All pages under
/docs/rev5/path
-
Public Notices (single page)
- Created: 2026-03-23
- Updated: 2026-04-10
- Target sites:
- Format version: Rev5
This specification is provided for research and documentation purposes.