An AI-powered agent that automatically tests and validates tutorials using browser automation and intelligent decision-making.
- Browser Automation: Uses Playwright MCP Server for realistic browser interactions
- AI-Powered Testing: LangGraph ReAct agent with Claude 4 Sonnet through SAP's Generative AI Hub
- Tutorial Validation: Automatically executes tutorial steps and identifies issues
- LLM: Claude 4 Sonnet access through SAP's Generative AI Hub (Cloud SDK for AI - Python)
- Agent Framework: LangGraph with ReAct pattern
- Browser Automation: Playwright MCP Server
tutorial-system-agent/
βββ tutorial-ai-agent/
β βββ agent/
β β βββ tutorial_tester_agent.py # Main agent
β βββ data/
β βββ tutorials/ # Tutorial markdown files
β βββ output/ # JSON results
βββ requirements.txt # Python dependencies
βββ docker-compose.yml # Container orchestration
βββ Dockerfile # Container definition
- Load Tutorial: Reads markdown tutorial files
- Initialize Agent: Creates LangGraph ReAct agent with MCP tools
- Execute Steps: Agent follows tutorial instructions in browser
- Validate Results: Checks for completion and identifies issues
- Generate Report: Saves structured JSON logs with timestamps
Key settings in tutorial_tester_agent.py:
TUTORIAL_FILE: Tutorial to test (default: "ailaunchpad-orchestration.md")RECURSION_LIMIT: Max agent steps (default: 500)MODEL_ID: LLM model (default: "anthropic--claude-4-sonnet")