Skip to content

Agent-Tools/awesome-autonomous-web

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

23 Commits
Β 
Β 

Repository files navigation

πŸ€–πŸ›œ Awesome AI Agent Tools for the Web

A curated list of tools that empower AI agents to interact with the web β€” from cloud browsers and authentication to automation frameworks, enterprise packages, benchmarks, and search.


πŸ’» Browsers for AI

Cloud-hosted and open-source browsers designed for AI agents.

  • Anchor Browser β€” Cloud browser built for AI agents. Handles sessions, CAPTCHAs, MFA, and proxies automatically. Free tier available.
  • Steel β€” Open-source headless browser API for AI agents. Self-hostable or cloud. 6k+ GitHub stars. (GitHub)
  • Browserbase β€” Cloud browser platform with session replay, stealth mode, and Playwright/Puppeteer support. Paid.
  • Browserless β€” Scalable headless Chrome in the cloud. Paid.
  • APIFY β€” Browser automation + web scraping platform. Paid.

πŸ”Œ MCP Servers for Browser Automation

Model Context Protocol servers that give LLMs (Claude, GPT-4, etc.) direct browser control.

  • Playwright MCP β€” Official Playwright MCP server. Uses accessibility snapshots (not screenshots) for fast, reliable LLM-driven automation. Open-source.
  • Steel MCP Server β€” Puppeteer-based MCP server backed by Steel cloud or local Docker. Open-source.
  • Anchor Browser MCP β€” MCP server for Anchor Browser cloud sessions. Handles auth, CAPTCHAs, and concurrent sessions.

πŸͺͺ Web Authentication

Tools that enable AI agents to access authenticated websites and sessions.

  • Anchor Browser β€” Built-in MFA handling, cookie/session persistence, and fingerprint management for agents.

πŸ‘€ AI Web Copilots

Autonomous AI agents that navigate and complete tasks on the web.

  • Please (formerly MultiOn) β€” Consumer AI agent for web tasks (travel booking, forms, shopping). Rebranded from MultiOn in 2025.
  • Jace.AI β€” AI agent for complex multi-step web workflows. Paid.

πŸ“ AI Web Automation Frameworks

Tools and SDKs that power web automation with AI.

Open Source

  • Stagehand by Browserbase β€” AI-native browser automation SDK. Built on Playwright with act(), extract(), observe(), and agent() primitives. Works with Claude, GPT-4, Gemini. (GitHub)
  • Browser-use β€” Python library connecting LLMs to browser actions. High-level task interface over Playwright.
  • Skyvern-AI β€” Visual LLM-driven browser agent. Understands pages through screenshots rather than DOM.
  • Agent-E β€” Hierarchical AI agent for browser automation tasks.
  • LaVague β€” Natural language browser control using text language models.
  • Tarsier by Reworkd β€” Vision utilities for web agents (element tagging, OCR).
  • Notte β€” Framework for building web agents with planning, vision, and LLM reasoning.

Paid Platforms

  • Axiom.ai β€” No-code browser automation bots. Paid.
  • Browse.ai β€” Extract and monitor web data without code. Paid.
  • ZeroStep β€” AI-powered test automation using plain English assertions. Paid.

🏒 Enterprise System Automation Packages

Pre-built, open-source browser automation for enterprise platforms. Powered by Puppeteer and Anchor Browser cloud. Published as @browser-automation-hub/* npm packages.

System npm Package Category
Epic EHR @browser-automation-hub/epic-ehr-browser-automation Healthcare
Cerner Powerchart @browser-automation-hub/cerner-powerchart-browser-automation Healthcare
Workday HCM @browser-automation-hub/workday-hcm-browser-automation HR
ADP Workforce Now @browser-automation-hub/adp-workforce-now-browser-automation HR
Salesforce Classic @browser-automation-hub/salesforce-classic-browser-automation CRM
ServiceNow @browser-automation-hub/servicenow-browser-automation ITSM
SAP Concur @browser-automation-hub/sap-concur-browser-automation ERP
SAP Fiori @browser-automation-hub/sap-fiori-browser-automation ERP
Oracle EBS @browser-automation-hub/oracle-ebs-browser-automation ERP
PeopleSoft @browser-automation-hub/peoplesoft-browser-automation ERP
Veeva Vault @browser-automation-hub/veeva-vault-browser-automation Life Sciences
Veeva CRM @browser-automation-hub/veeva-crm-browser-automation Life Sciences
Yardi Voyager @browser-automation-hub/yardi-voyager-browser-automation Real Estate
Procore @browser-automation-hub/procore-browser-automation Construction
SuccessFactors @browser-automation-hub/successfactors-browser-automation HR

β†’ View all 30 packages on GitHub Β· Postman Collections


πŸ”¬ Web Agent Benchmarks

Datasets and leaderboards for evaluating web agents.

  • WebArena β€” Realistic web tasks across multiple sites. Open-source.
  • WebArena Leaderboard β€” Live rankings.
  • VisualWebArena β€” Multimodal extension of WebArena with image-based tasks. Open-source.
  • WebCanvas β€” Online web agent evaluation with live environments. Open-source.
  • WorkArena β€” Benchmark for enterprise web tasks (ServiceNow). Open-source.
  • Mind2Web β€” Cross-task, cross-website generalization benchmark. Open-source.
  • Bananalyzer by Reworkd β€” Web agent evaluation framework. Open-source.

πŸ–ΌοΈ Visual Web Automation

Computer vision and multimodal approaches to web interaction.

  • Skyvern-AI β€” Visual LLM agent that reads pages as screenshots. Open-source.
  • VimGPT β€” GPT-4V driving a browser via keyboard shortcuts. Open-source.

πŸ•·οΈ AI Web Crawlers

Intelligent crawlers for data extraction at scale.

  • FireCrawl β€” LLM-ready web crawling and scraping with Markdown output. Paid.
  • Crawlee β€” Open-source web scraping and crawling library by APIFY. Supports Playwright, Puppeteer, Cheerio. Open-source.
  • Spider β€” Fast AI web crawler with agent mode. Paid.
  • Reworkd β€” AI-driven data extraction pipelines. Paid.

πŸ” Web Search Tools

Search APIs for giving agents real-time web access.

  • Tavily β€” Search API built for AI agents. Returns clean, LLM-optimized results. Paid (free tier).
  • Exa β€” Neural search API for web content. Semantic search + keyword. Paid.
  • Serper.dev β€” Google Search API for AI agents. Paid.
  • Brave Search API β€” Independent search index with AI-ready API. Paid (free tier).

πŸ”“ CAPTCHA Solving

Tools for AI agents to handle CAPTCHAs and bot detection.

  • Anchor Browser β€” Automatic CAPTCHA solving built into the cloud browser runtime.
  • Capsolver β€” CAPTCHA solving API and browser extension. Paid.
  • 2captcha β€” Human-powered CAPTCHA solving API. Paid.

πŸ› οΈ GraphQL Consumption Tools

  • GraphQL.Chat β€” Explore and query GraphQL APIs with natural language. Free, closed source.
  • GQLPT.dev β€” Generate GraphQL queries from plain text. Free, closed source.

Feel free to open a PR to add tools or update descriptions. This list is maintained by the community.

Releases

No releases published

Packages

 
 
 

Contributors