Browser MCP Server

A lightweight Model Context Protocol (MCP) server that gives AI agents controlled, real-browser automation through Chrome/Chromium via Chrome DevTools Protocol (CDP).

It is designed to be predictable and safe: allowlisted hosts, bounded timeouts, stable tooling, and one shared session for multi-step runs.

What you get

Drive a real browser: click, type, scroll, drag, screenshot, and navigate.
Make long sequences reliable with run(...) and flow(...) (single call, bounded, low noise).
Extension mode for your existing Chrome profile (no restart, no debug port).

Quick start (golden path)

One-time setup:

./tools/setup

Diagnose environment:

./tools/doctor

Verify the repo is healthy:

./tools/gate

Run the server (extension mode is default):

./scripts/run_browser_mcp.sh

Choose a mode

Recommended: Extension mode (no restart).

Drives your existing Chrome profile with your tabs/cookies/extensions.
No CDP port needed; fewer connection headaches.

./scripts/run_browser_mcp.sh

Attach to a CDP port (classic).

./scripts/start_user_browser_cdp.sh
MCP_BROWSER_MODE=attach ./scripts/run_browser_mcp.sh

Launch a dedicated Chromium (clean profile).

python -m pip install -r requirements.txt
./scripts/install_local_chromium.sh
MCP_BROWSER_MODE=launch ./scripts/run_browser_mcp.sh

Configuration

Environment variables:

MCP_BROWSER_BINARY — path to Chrome/Chromium binary. If unset, the server auto-detects in this order:
1. Local Chromium: vendor/chromium/chrome (portable, installed via install_local_chromium.sh)
2. System Chromium: /usr/bin/chromium, /usr/bin/chromium-browser, etc.
3. System Chrome: /usr/bin/google-chrome, /usr/bin/google-chrome-stable, etc.
4. Snap Chromium (last resort - has known issues)
MCP_BROWSER_MODE — lifecycle mode: extension (recommended), attach, or launch.
MCP_BROWSER_PROFILE — user-data-dir; default ~/.gemini/browser-profile.
MCP_BROWSER_PORT — remote debugging port; default 9222.
MCP_BROWSER_FLAGS — extra flags appended to Chrome launch.
MCP_EXTENSION_RPC_TIMEOUT — extension-mode RPC/CDP timeout seconds (default 8).
MCP_EXTENSION_CONNECT_TIMEOUT — wait-for-extension connect timeout seconds (default 4).
MCP_NATIVE_HOST_AUTO_INSTALL — auto-install Native Messaging host on startup (default 1; set 0 to disable).
MCP_EXTENSION_AUTO_LAUNCH — auto-launch managed Chrome with the extension if no broker is found (default 0; opt-in).
MCP_EXTENSION_PROFILE — user-data-dir for the managed extension Chrome profile (default ~/.gemini/browser-extension-profile).
MCP_EXTENSION_IDS — comma-separated extension IDs to allow in the native host manifest (optional).
MCP_AUTO_PORT_FALLBACK — if set to 1, allows switching to a free port + an owned profile when the configured port is busy/unresponsive (default: 0).
MCP_ALLOW_HOSTS — comma-separated allowlist (e.g., example.com,github.com). Empty or * disables host filtering.
MCP_HTTP_TIMEOUT — request timeout seconds (default 10).
MCP_HTTP_MAX_BYTES — maximum bytes to return from HTTP responses (default 1_000_000).
MCP_PERMISSION_POLICY — JSON policy for per-origin permissions (optional; see docs/RUN_GUIDE.md).
MCP_PERMISSION_ALLOW — semicolon rules: origin=perm1,perm2;origin2=perm3 (optional).
MCP_PERMISSION_DENY — semicolon rules: origin=perm1,perm2;origin2=perm3 (optional).
MCP_PERMISSION_DEFAULT — default setting prompt|deny|allow (default prompt).
MCP_PERMISSION_DEFAULT_PERMS — comma permissions used with MCP_PERMISSION_DEFAULT.
MCP_HEADLESS — set to 1 for headless mode, 0 for visible window (default: 1).
MCP_WINDOW_SIZE — initial window size in visible mode, format width,height (default: 1280,900).

Available tools

This server exports a small set of unified tools. The canonical source of truth is tools/list (and the generated snapshot in contracts/).

Tool	What it does
`page`	Analyze page structure/content; diagnostics/resources/perf/locators
`extract_content`	Structured content extraction with pagination
`flow`	Batch multiple steps into one call (single compact summary + optional screenshot)
`run`	OAVR runner (Observe → Act → Verify → Report); uses `flow` under the hood
`app`	High-level macros/adapters for complex apps (e.g. `app(op='diagram')`, `app(op='insert')`)
`navigate`	Navigate/back/forward/reload (unified)
`click`	Click by text/selector/coordinates
`type`	Type text, type into selector, or press key
`scroll`	Scroll directions or to element/top/bottom
`form`	Fill/select/focus/clear/wait-for-element
`screenshot`	Screenshot page or element
`tabs`	List/switch/new/close tabs
`cookies`	Get/set/delete cookies
`captcha`	Detect/interact with common CAPTCHA flows
`mouse`	Move/hover/drag low-level
`resize`	Resize viewport/window
`js`	Evaluate JS in the page
`http`	Safe HTTP GET outside the browser (allowlist enforced)
`fetch`	Fetch from page context (cookies/session; subject to CORS)
`upload`	Upload file(s) to file input
`dialog`	Handle alert/confirm/prompt
`totp`	Generate TOTP codes (2FA helper)
`wait`	Wait for navigation/load/element/text
`browser`	Launch/status; DOM/element helpers

Docs and guides

docs/RUN_GUIDE.md — minimal-call run/flow examples
docs/AGENT_PLAYBOOK.md — patterns for low-noise automation
docs/MACROS.md — macro catalog for run(...)
docs/RUNBOOKS.md — recording and replaying step lists
docs/RELEASE_NOTES.md — recent changes
TROUBLESHOOTING.md — common fixes

Safety notes

Set MCP_ALLOW_HOSTS to the minimal set you need; otherwise the server will allow all hosts.
Ensure the CDP port (MCP_BROWSER_PORT) is free before launching to avoid hijacking an existing browser session.
Headless runs reuse the same profile; isolate with dedicated profiles if you need stricter separation.

Architecture

┌──────────────────┐      ┌─────────────────┐      ┌─────────────────┐
│   AI Agent       │─────▶│   MCP Server    │─────▶│   Chrome        │
│   (Claude, etc)  │ MCP  │   (Python)      │ CDP  │   Browser       │
└──────────────────┘      └─────────────────┘      └─────────────────┘

The server uses Chrome DevTools Protocol (CDP) directly via WebSocket for all browser automation, including cookie management and in-page fetch requests.

Testing

Recommended:

./tools/gate

Focused runs:

pytest -q --maxfail=1 --cov=mcp_servers --cov-report=term-missing

Live integration (real sites):

RUN_BROWSER_INTEGRATION=1 pytest -q tests/test_real_sites_smoke.py

Strict live allowlist (fail on low pass-rate):

RUN_BROWSER_INTEGRATION=1 RUN_BROWSER_INTEGRATION_EDGE=1 \
RUN_BROWSER_INTEGRATION_LIVE_STRICT=1 \
RUN_BROWSER_INTEGRATION_LIVE_ALLOWLIST=content_root_debug,table_index,container_news \
RUN_BROWSER_INTEGRATION_LIVE_MIN_PASS=1.0 \
pytest -q tests/test_real_sites_smoke.py

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.serena		.serena
.tasks/desktop/devtools		.tasks/desktop/devtools
ai		ai
contracts		contracts
docs		docs
mcp_servers		mcp_servers
scripts		scripts
tests		tests
tools		tools
vendor		vendor
.apply_task_projects.yaml		.apply_task_projects.yaml
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
GOALS.md		GOALS.md
LEGEND.md		LEGEND.md
MAP.md		MAP.md
PHILOSOPHY.md		PHILOSOPHY.md
README.md		README.md
REPO_RULES.md		REPO_RULES.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
VISIBLE_MODE.md		VISIBLE_MODE.md
mcp_config.json		mcp_config.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Browser MCP Server

What you get

Quick start (golden path)

Choose a mode

Configuration

Available tools

Docs and guides

Safety notes

Architecture

Testing

About

Uh oh!

Releases 8

Packages

Contributors 2

Uh oh!

Languages

AmirTlinov/browser

Folders and files

Latest commit

History

Repository files navigation

Browser MCP Server

What you get

Quick start (golden path)

Choose a mode

Configuration

Available tools

Docs and guides

Safety notes

Architecture

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 2

Uh oh!

Languages

Packages