Generic architecture for browser-bound MCP tools over Tailscale trust networks

## Summary

Today, browser-bound MCP integrations tend to fall into two extremes:

1. **Local-only browser bridges**  
   - browser extension / localhost-only companion
   - great for single-machine use
   - hard to reuse from other trusted devices

2. **Raw browser automation / DevTools exposure**  
   - powerful, but too low-level and too risky to expose broadly
   - poor security boundary if shared across a trust network

There is a useful middle ground that OpenClaw could support/document as a **generic architecture pattern**:

- keep the browser/session runtime local to one trusted host
- expose only high-level MCP tools to the rest of the trusted network
- use Tailscale (or equivalent private network) as the transport boundary

This would enable “browser capability as a private infrastructure service” without exposing raw browser control.

---

## Problem

For integrations that depend on:

- a real logged-in browser session
- cookies / localStorage / browser fingerprint
- extension-based site access
- site-specific flows (e.g. X, XiaoHongShu, LinkedIn, etc.)

`localhost` is often too limiting, but exposing raw CDP / DevTools / extension internals is not desirable.

What users often want instead is:

- one persistent **browser host**
- accessible from multiple trusted devices / agents
- over a private trust network such as **Tailscale**
- while only exposing constrained, auditable, high-level capabilities

Examples:

- `search_x_posts(query)`
- `read_x_post(url)`
- `search_xhs(keyword, filters)`
- `get_xhs_feed_detail(feed_id, xsec_token)`

Not:

- arbitrary JS eval
- full CDP control
- unrestricted browser navigation
- cookie export

---

## Proposed Architecture

### 1. Browser Host Runtime
A fixed machine runs:

- Chrome/Edge
- logged-in user profile
- browser extensions / content scripts as needed
- real session state (cookies, storage, fingerprint, etc.)

This host is the only place that directly touches the website runtime.

### 2. Local Bridge
A localhost-only bridge process handles communication between:

- browser extension / page runtime
- local service logic

Examples:

- `127.0.0.1` HTTP
- localhost WebSocket
- native messaging
- Unix socket / stdio bridge

This layer should **not** be directly exposed to the network.

### 3. Tailnet-facing Gateway
A separate gateway process exposes:

- MCP tools
- or a clean HTTP/JSON-RPC API that can be wrapped as MCP

This gateway is the only externally reachable surface, and only inside the trusted network (for example, Tailscale tailnet).

It exposes **high-level site capabilities**, not raw browser control.

### 4. Trust Network Transport
Use Tailscale (or similar private networking) so that:

- only trusted devices can reach the gateway
- access can be controlled with ACLs / tags / device identity
- the service remains private without public Internet exposure

---

## Why this pattern is valuable

### Better security boundary
Instead of exposing a raw browser control plane, only a constrained tool surface is shared.

### Better reuse
One logged-in browser host can serve multiple trusted agents/devices.

### Better maintainability
The implementation details of the extension/browser runtime can evolve without changing the external MCP contract.

### Better operational model
This turns fragile “local browser hacks” into a reusable private service model.

---

## Recommended design principles

### Expose only high-level tools
Good examples:

- `search_*`
- `read_*`
- `get_*_detail`
- explicitly approved `post_*` operations

Avoid exposing by default:

- arbitrary JS evaluation
- raw CDP methods
- unrestricted navigation
- cookie/session export

### Keep the bridge local-only
The browser-facing bridge should remain localhost-only.  
Only the gateway should be reachable over the trust network.

### Add app-layer auth on top of Tailscale
Tailscale is a strong network boundary, but the gateway should still support:

- bearer token / API token
- optional per-tool authorization
- audit logging
- rate limiting

### Support policy separation
Different deployments may want:

- read-only browsing/search
- read + publish
- single-user access
- shared trusted-team access

This should be reflected as policy/config, not hardcoded assumptions.

---

## Possible OpenClaw/Helm relevance

This may be useful as either:

1. **documentation / architecture guidance**
2. **an example deployment pattern**
3. **a future chart pattern for browser-bound private services**
4. **a generic “gateway + local bridge + browser runtime” deployment model**

For Helm users specifically, a documented pattern could help clarify:

- what should stay localhost-only
- what should be exposed to the tailnet
- how to structure auth and ACLs
- how to safely share browser-bound capabilities across trusted devices

---

## Example deployment shape

```text
Trusted devices / agents
        │
        ▼
   Tailscale tailnet
        │
        ▼
Tailnet-facing MCP/API Gateway
        │
   localhost only
        ▼
     Local Bridge
        │
        ▼
Chrome/Edge + extension + logged-in profile
        │
        ▼
Target websites
```

---

## Request

I think this architecture pattern is broadly useful beyond any single site integration.

Would it make sense to document or standardize a recommended pattern for:

- browser-bound MCP tools
- local browser/session runtimes
- tailnet/private-network exposure
- safe separation between browser control and shared capability APIs

Even a short architecture note in the Helm/OpenClaw docs would likely help users design these systems more safely.

---

## zh-TW summary

建議把「已登入瀏覽器 + 本機 bridge + 對 tailnet 暴露高階 MCP gateway」整理成一個通用架構模式，讓可信任網路中的多台裝置可安全共用 browser-bound 能力，而不是直接暴露 raw browser / CDP 控制面。


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic architecture for browser-bound MCP tools over Tailscale trust networks #26

Summary

Problem

Proposed Architecture

1. Browser Host Runtime

2. Local Bridge

3. Tailnet-facing Gateway

4. Trust Network Transport

Why this pattern is valuable

Better security boundary

Better reuse

Better maintainability

Better operational model

Recommended design principles

Expose only high-level tools

Keep the bridge local-only

Add app-layer auth on top of Tailscale

Support policy separation

Possible OpenClaw/Helm relevance

Example deployment shape

Request

zh-TW summary

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Generic architecture for browser-bound MCP tools over Tailscale trust networks #26

Description

Summary

Problem

Proposed Architecture

1. Browser Host Runtime

2. Local Bridge

3. Tailnet-facing Gateway

4. Trust Network Transport

Why this pattern is valuable

Better security boundary

Better reuse

Better maintainability

Better operational model

Recommended design principles

Expose only high-level tools

Keep the bridge local-only

Add app-layer auth on top of Tailscale

Support policy separation

Possible OpenClaw/Helm relevance

Example deployment shape

Request

zh-TW summary

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions