Universal App Bridge (UAB)

Smart function discovery and framework-level desktop app control for AI agents.

UAB doesn't just automate apps — it discovers, identifies, learns, and remembers how to control every application on your system. The first time it sees an app, it figures out what framework it uses, which control method works best, and stores that knowledge for instant recall. Every subsequent interaction is faster and smarter.

The runtime also exposes its discovery model directly. Embedding systems can inspect the registered framework hooks, framework-detection signatures, Concerto method inventory, and per-operation control plans instead of treating UAB as a black box.

One-Click Install

UAB ships as a packaged installer. Run it once and every AI agent on the machine gets native desktop control.

# GUI installer (recommended)
cd installer && npm install && npx electron src/main.js

# CLI install (for terminal users)
uab-bridge install

The installer:

Starts UABServer as a system service (auto-starts on boot)
Installs the Chrome extension for browser bridge
Writes skill files for Claude Co-work AND Claude Code
Generates an API key for authenticated access
Detects host network for VM accessibility

Works with: Claude Co-work, Claude Code CLI, Claude Code Desktop, and any agent that can make HTTP calls.

The Core Innovation: Smart Function Discovery

Most automation tools require you to know what app you're controlling and how to connect. UAB figures it out for you:

        ┌──────────────────────────────────────────────────────────┐
        │              Smart Function Discovery                     │
        │                                                          │
        │  1. SCAN ─────────► DLL module scanning                  │
        │     "What's running?"   Batch process enumeration        │
        │                         Window title fetching            │
        │                                   │                      │
        │  2. IDENTIFY ─────► Framework signature matching         │
        │     "What framework?"   electron.exe → Electron          │
        │                         qt6core.dll  → Qt6               │
        │                         xlcall32.dll → Office            │
        │                         jvm.dll      → Java              │
        │                                   │                      │
        │  3. REGISTER ─────► In-memory Map + JSON persistence     │
        │     "Remember this"     O(1) lookup by PID or name       │
        │                         Dual-indexed (exe + PID)         │
        │                         Git-friendly registry.json       │
        │                                   │                      │
        │  4. CONNECT ──────► Plugin cascade with fallback         │
        │     "Best method?"      CDP → COM → UIA (automatic)     │
        │                         Preferred method remembered      │
        │                                   │                      │
        │  5. LEARN ────────► Update registry with results         │
        │     "Next time faster"  Store preferred control method   │
        │                         Cache element trees              │
        │                         Track connection health          │
        └──────────────────────────────────────────────────────────┘

What Makes This "Smart"?

Traditional Automation	UAB Smart Discovery
You specify the app and how to connect	UAB scans the system and finds everything automatically
Hard-coded framework assumptions	DLL scanning identifies the exact framework with confidence scores
No memory between sessions	Registry persists knowledge in JSON — instant recall next time
Single control method	Cascade tries best method first, falls back automatically
Manual configuration per app	Zero-config — scan once, control anything

Quick Start

As a Library

import { UABConnector } from 'universal-app-bridge';

const uab = new UABConnector();
await uab.start();

// 1. SCAN — Discover everything running
const apps = await uab.scan();
// → 79 apps found, frameworks identified, profiles registered

// 2. FIND — Smart lookup (registry first, live detection fallback)
const excel = await uab.find('excel');
// → Instant hit from registry (O(1) Map lookup)

// 3. CONNECT — Best method selected automatically
const conn = await uab.connect('excel');
// → { pid: 5678, name: 'EXCEL', framework: 'office', method: 'office-com+uia', elementCount: 342 }

// 4. QUERY — Search the UI tree
const buttons = await uab.query(conn.pid, { type: 'button', label: 'Save' });

// 5. ACT — Perform actions (permission-checked, retried, cache-aware)
await uab.act(conn.pid, buttons[0].id, 'click');

// Next session: scan() is instant because registry.json remembers everything
await uab.stop();

As a CLI (for any AI agent)

The CLI outputs pure JSON — designed for Claude, GPT, or any agent calling via bash:

# Scan and register all running apps
uab scan
# → { "success": true, "apps": [...79 apps with frameworks...] }

# List known apps from registry (instant, no scan needed)
uab apps
# → Instant recall from registry.json

# Smart search — registry first, live detection fallback
uab find "notepad"

# Connect with automatic method selection
uab connect notepad
# → { "pid": 1234, "method": "win-uia", "elementCount": 15 }

# Query and act
uab query 1234 --type button --label "Save"
uab act 1234 btn_42 click

# Registry persists between sessions — next time is instant
uab profiles
# → Shows all known apps with framework info and preferred methods

As an HTTP Server (for remote / server-side agents)

Run UAB as a REST API so agents on other machines, in containers, or in cloud environments can control desktop apps remotely:

# Start the server (localhost only)
uab serve --port 3100

# Listen on all interfaces (for VM or remote access)
uab serve --port 3100 --host 0.0.0.0

# With authentication (recommended for non-localhost)
uab serve --port 3100 --host 0.0.0.0 --api-key my-secret-key

# From any HTTP client or remote agent:
curl -X POST http://localhost:3100/scan
curl -X POST http://localhost:3100/find -d '{"query":"notepad"}'
curl -X POST http://localhost:3100/connect -d '{"target":"notepad"}'
curl -X POST http://localhost:3100/query -d '{"pid":1234,"selector":{"type":"button"}}'
curl -X POST http://localhost:3100/act -d '{"pid":1234,"elementId":"btn_1","action":"click"}'
curl -X POST http://localhost:3100/plan -d '{"pid":1234,"action":"hotkey"}'
curl -X POST http://localhost:3100/open -d '{"target":"notepad"}'
curl -X POST http://localhost:3100/focus -d '{"pid":1234}'
curl -X POST http://localhost:3100/describe -d '{"pid":1234}'

# P6 — OS raw input injection for spatial gestures
curl -X POST http://localhost:3100/drag -d '{"pid":1234,"path":[{"x":100,"y":200},{"x":300,"y":200}],"button":"left"}'
curl -X POST http://localhost:3100/scroll -d '{"pid":1234,"x":500,"y":400,"amount":3}'

# Health check
curl http://localhost:3100/health

// Or programmatically:
import { UABServer } from 'universal-app-bridge/server';

const server = new UABServer({ port: 3100, host: '0.0.0.0', apiKey: 'secret' });
await server.start();
// Clients POST JSON to /scan, /connect, /query, /act, /open, /focus, /describe, etc.

Environment Auto-Detection

UAB automatically detects its runtime context and tunes behavior accordingly:

Environment	Session	Persistence	Rate Limit	Extension Bridge
Desktop	Session 1+	Persistent connections	100/min/PID	Enabled
Server	Session 0 (SSH/service)	Stateless	60/min/PID	Disabled
Container	Docker/WSL	Stateless	30/min/PID	Disabled

# Check what UAB detected:
uab env
# → { "environment": { "mode": "desktop", "hasDesktop": true, ... }, "defaults": { ... } }

ONE codebase, ZERO configuration — UAB figures out where it's running and adapts.

Architecture

Agent Runtime (Claude / GPT / Any AI Agent)
         │
    Library API  or  CLI (JSON)  or  HTTP Server (REST)
         │
┌────────┴───────────────────────────────────────────────────┐
│              Universal App Bridge (UAB)                      │
│                                                             │
│  ┌─────────────┐  ┌────────────┐  ┌─────────────────────┐  │
│  │  Smart       │  │  App       │  │   UAB Connector     │  │
│  │  Detector    │  │  Registry  │  │   (Public API)      │  │
│  │             │  │  (Brain)   │  │                     │  │
│  │ DLL scan    │  │ Map + JSON │  │ scan() find()       │  │
│  │ Batch enum  │  │ O(1) lookup│  │ connect() query()   │  │
│  │ Signatures  │  │ Persist    │  │ act() state()       │  │
│  └──────┬──────┘  └─────┬──────┘  └──────────┬──────────┘  │
│         │               │                     │             │
│         └───────────────┼─────────────────────┘             │
│                         │                                   │
│  ┌──────────────────────┴────────────────────────────────┐  │
│  │                  Plugin Manager                        │  │
│  │  ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ │  │
│  │  │Chrome Ext│ │ Browser  │ │ Electron │ │  Office  │ │  │
│  │  │  (WS)    │ │  (CDP)   │ │  (CDP)   │ │(COM+UIA) │ │  │
│  │  └──────────┘ └──────────┘ └──────────┘ └──────────┘ │  │
│  │  ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ │  │
│  │  │   Qt     │ │   GTK    │ │  Java    │ │ Flutter  │ │  │
│  │  │  (UIA)   │ │  (UIA)   │ │(JAB→UIA) │ │  (UIA)   │ │  │
│  │  └──────────┘ └──────────┘ └──────────┘ └──────────┘ │  │
│  │  ┌──────────┐ ┌──────────┐                              │  │
│  │  │ Win-UIA  │ │  Vision  │                              │  │
│  │  │ (A11y)   │ │(AI last  │                              │  │
│  │  │          │ │ resort)  │                              │  │
│  │  └──────────┘ └──────────┘                              │  │
│  └───────────────────────────────────────────────────────┘  │
│                                                             │
│  ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────────┐   │
│  │  Cache   │ │Permission│ │  Retry   │ │ Chain Engine │   │
│  │ (3-tier) │ │ (Audit)  │ │(Backoff) │ │ (Workflows)  │   │
│  └──────────┘ └──────────┘ └──────────┘ └──────────────┘   │
│                                                             │
│  ┌──────────────────┐  ┌────────────────────────────────┐   │
│  │ Control Router   │  │  Connection Manager            │   │
│  │ (Cascade+Fallback│  │  (Health+Reconnect+Cleanup)    │   │
│  └──────────────────┘  └────────────────────────────────┘   │
└─────────────────────────────────────────────────────────────┘
         │
    Operating System (CDP, UIA, COM, PowerShell, WMI)
         │
    Desktop Applications

The Cascade Pattern

UAB picks the best control method for each operation automatically. The standalone runtime now exposes this through a Concerto inventory plus per-operation planning. Each micro-operation uses the most efficient method based on speed, outcome quality, control precision, and cost:

Priority 1: Direct API / MCP endpoint (when the app exposes one)
Priority 2: Chrome Extension Bridge (browsers — no relaunch needed)
Priority 3: Browser CDP (browsers — with debug flag)
Priority 4: Framework Hook (Electron CDP, Office COM, Qt/GTK/Java/Flutter hook wrappers)
Priority 5: Windows UI Automation (win-uia fallback — any windowed app)
Priority 6: Keyboard Native (shortcuts, hotkeys, text input — fastest for commands)
Priority 7: OS Raw Input Injection (drag, scroll, gestures — SendInput/CGEventPost/xdotool)
     Vision Analysis: Screenshot + AI (reading state, verifying results — the agent's eyes)

The cascade isn't "pick one method per app" — it's "pick the right method for each operation." A single Blender sculpting session uses keyboard for commands (Ctrl+Tab, Ctrl+4), drag for brush strokes, scroll for zooming, and screenshots for verification. Five methods in one workflow. That's the concerto.

P6 — OS Raw Input Injection injects mouse drag, scroll, and gesture events directly into the OS input stream via SendInput(). Any application receives these exactly as if a human moved the mouse. This enables sculpting in Blender, painting in Photoshop, drawing in any canvas app — operations that require continuous held-button mouse movement.

Smart Discovery Deep Dive

The standalone runtime now exposes this contract directly:

hookInventory() / GET /info.frameworkHooks — all registered framework hooks
signatureInventory() / GET /info.frameworkSignatures — framework detection signatures
concertoInventory() / GET /info.concertoMethods — operation-level method inventory
planOperation() / POST /plan — per-operation Concerto planning

Phase 1: Detection

UAB scans the system using three batched PowerShell calls (not per-process — batched for speed):

WMI Process Enumeration — Get all running processes with PIDs, names, paths, command lines
Batch DLL Module Scan — One PowerShell call scans loaded modules for ALL processes (batches of 50)
Batch Window Title Scan — One P/Invoke call via EnumWindows gets all visible window titles

Result: Full system scan in 2-5 seconds. Finds 79+ controllable apps on a typical Windows desktop.

Phase 2: Framework Identification

Each detected process is matched against framework signatures:

// Example: How UAB identifies an Electron app
{
  framework: 'electron',
  modules: ['electron.exe', 'libcef.dll', 'chrome_elf.dll', 'v8.dll'],
  filePatterns: ['resources/app.asar', 'resources/app.asar.unpacked'],
  commandLine: ['--type=renderer', 'electron', 'app.asar'],
  baseConfidence: 0.9
}

Confidence accumulates: base score + module matches + command-line matches + file pattern matches. An Electron app loading chrome_elf.dll AND having resources/app.asar gets confidence 0.95.

10 framework signatures built in: Electron, Qt5, Qt6, GTK3, GTK4, WPF, .NET, Flutter, Java, Office.

Plus fast-path detection for browsers (Chrome, Edge, Brave) and Office apps (Word, Excel, PowerPoint, Outlook) by executable name.

Phase 3: Registry & Persistence

Every detected app is registered in the App Registry — UAB's brain:

// What the registry stores per app
interface AppProfile {
  executable: string;       // Stable key: "code.exe"
  name: string;             // "Visual Studio Code"
  pid: number;              // Last known PID
  framework: FrameworkType; // "electron"
  confidence: number;       // 0.95
  preferredMethod: string;  // "browser-cdp", "office-com+uia", "win-uia", etc.
  path: string;             // Full executable path
  windowTitle: string;      // "project - Visual Studio Code"
  lastSeen: number;         // Unix timestamp
  tags: string[];           // User-defined categorization
}

The registry uses dual-indexed Maps for O(1) lookups:

Map<executable, AppProfile> — lookup by executable name
Map<pid, executable> — lookup by PID → executable → profile

JSON persistence: The entire registry is saved to data/uab-profiles/registry.json — a single, git-friendly file with readable diffs. No database required.

Phase 4: Smart Lookup

When you call find("excel"), UAB doesn't scan the system again. It:

Checks the registry first — O(1) Map lookup, case-insensitive substring match
Returns instantly if found (< 1ms)
Only falls back to live detection if not in registry

This is why the first scan() takes 2-5 seconds, but every subsequent find() is instant.

Phase 5: Learning

After each successful connection, UAB updates the registry with what worked:

// After connecting to VS Code via CDP:
registry.update('code.exe', {
  preferredMethod: 'browser-cdp',  // Remember the exact working method
  pid: 12345,                // Update last known PID
  lastSeen: Date.now()       // Update timestamp
});

Next time you connect to VS Code, UAB tries CDP first because it learned that's the best method.

Supported Frameworks

Framework	Plugin	Method	Apps Covered
Direct API apps	DirectApiPlugin	HTTP/JSON	Apps that expose a local control endpoint
Chrome/Edge/Brave	ChromeExtPlugin	`chrome-extension`	Any Chromium browser — tabs, cookies, DOM, storage, JS exec
Chrome/Edge/Brave	BrowserPlugin	`browser-cdp`	Same browsers, requires `--remote-debugging-port`
Electron	ElectronPlugin	`electron-cdp`	VS Code, Slack, Discord, Notion, Obsidian, Spotify, Teams
MS Office	OfficePlugin	`office-com+uia`	Word, Excel, PowerPoint, Outlook
Qt 5/6	QtPlugin	`qt-uia`	VLC, Telegram Desktop, OBS Studio, VirtualBox, Wireshark
GTK 3/4	GtkPlugin	`gtk-uia`	GIMP, Inkscape, GNOME apps
WPF/.NET / Win32	WinUIAPlugin	`win-uia`	Windows enterprise apps, Visual Studio, generic fallback
Flutter	FlutterPlugin	`flutter-uia`	Google apps, Ubuntu desktop apps
Java Swing/FX	JavaPlugin	`java-jab-uia`	JetBrains IDEs, Android Studio

Unified API

Every framework plugin maps its native UI tree into the same types:

`uab.scan()` — Discover & Register

const apps = await uab.scan();
// Apps are detected, frameworks identified, and profiles registered
// Registry persists to disk — next session starts with full knowledge

`uab.find(name)` — Smart Lookup

const results = await uab.find('slack');
// 1. Checks registry (instant) → returns if found
// 2. Falls back to live detection → registers result

`uab.connect(target)` — Auto-Connect

// By name (searches registry, then live-detects)
const conn = await uab.connect('notepad');

// By PID (checks registry, auto-detects if not found)
const conn = await uab.connect(1234);

// Returns: { pid, name, framework, method, elementCount }

`uab.enumerate(pid)` — List UI Elements

const tree = await uab.enumerate(pid);
// Cached for 5 seconds — repeated calls are instant

`uab.query(pid, selector)` — Search Elements

const btns = await uab.query(pid, { type: 'button', label: 'Save' });
// Cached for 3 seconds, auto-invalidated after mutating actions

`uab.act(pid, elementId, action, params?)` — Perform Actions

await uab.act(pid, 'btn_1', 'click');
await uab.act(pid, 'input_3', 'type', { text: 'Hello' });
// Permission-checked → retried on transient failure → cache invalidated

Production Hardening

Smart Three-Tier Cache

┌──────────────────────────────────────────┐
│              Element Cache               │
│                                          │
│  Tree Cache    │  5s TTL per PID         │
│  Query Cache   │  3s TTL, 50 max/PID    │
│  State Cache   │  2s TTL per PID         │
│                                          │
│  Auto-invalidation on mutating actions:  │
│  click, type, keypress, navigate, etc.   │
│                                          │
│  Safe (no invalidation):                 │
│  focus, hover, scroll, screenshot, etc.  │
└──────────────────────────────────────────┘

Permission & Safety Model

Risk classification: safe / moderate / destructive
Rate limiting: 100 actions/min per PID (configurable)
Audit log: Last 1000 actions with timestamps, PIDs, elements, risk levels
Destructive action gating: close requires explicit confirmation when blocking is enabled

Health Monitoring

30-second health check intervals
Auto-reconnect with exponential backoff (1s → 2s → 4s → 8s)
Stale connection cleanup after 5 minutes of failure
Event callbacks for connection state changes

Retry with Backoff

Exponential backoff with 0-30% jitter
Retryable error detection (ECONNRESET, timeout, EPIPE, socket hang up)
Per-operation timeout with configurable limits
Labeled operations for debugging

Action Chains

Multi-step workflows with verification between steps:

const chain = {
  name: 'fill-form',
  pid: 1234,
  steps: [
    { type: 'action', selector: { label: 'Name' }, action: 'type', params: { text: 'John' } },
    { type: 'wait', selector: { type: 'button', label: 'Submit' }, timeoutMs: 5000 },
    { type: 'action', selector: { label: 'Submit' }, action: 'click' },
  ],
};

const result = await chainExecutor.execute(chain);

Chrome Extension Bridge

UAB includes a Chrome Extension (Manifest V3) that connects to your running browser via WebSocket — no browser relaunch required.

┌────────────────────┐    WebSocket     ┌────────────────────┐
│   UAB Service      │◄───(port 8787)──►│  Chrome Extension  │
│   (Node.js)        │    JSON protocol │  (Manifest V3)     │
└────────────────────┘                  └────────────────────┘

Full browser control: Tabs, cookies, localStorage, sessionStorage, navigation, JavaScript execution, screenshots — all without relaunching the browser.

Co-work Bridge

UAB works seamlessly with Claude Co-work. The installer writes skill files directly into Co-work's plugin directory. Co-work reaches UABServer through Chrome's localhost access — no port forwarding, no configuration.

The Chrome extension acts as a relay: Co-work → Chrome extension → localhost:3100 → UABServer → desktop apps.

Recursive Application Bridge

UAB doesn't just control apps — it learns how to control them better with every interaction.

The Flow Library (data/flow-library/) stores pre-built interaction sequences for every app UAB has successfully controlled. Each flow captures the exact steps, input method, and known quirks discovered through real-world testing:

ChatGPT: 1 Tab → type → Enter
Grok: 2 Tabs → keystroke activate → clipboard paste → Enter
Excel: COM API methods (no UI automation needed)
Notepad: Direct SendKeys type

When an agent encounters a new app, it checks GET /flow/{appname}. If a flow exists, the agent follows it mechanically — zero exploration, zero guessing. If no flow exists, UAB provides a framework-based default, and the agent saves the working sequence via POST /flow after success.

This creates a recursive improvement loop: Attempt → Verify → Learn → Store → Next attempt is instant. Unlike human muscle memory that degrades over time, the flow library is permanent, exact, and shared across every agent connected to UAB.

X-ray Vision for Agents

UAB gives AI agents the same visual understanding of applications that humans have — but in data form.

POST /deep-query scans the entire UI tree of any application and returns every named element — buttons, inputs, links, menus, text — with their types, supported actions, and screen positions. One call reveals everything a human can see.

POST /invoke acts on any element by name. Find "Copy" → click it. Find "New chat" → click it. No Tab navigation, no coordinate guessing, no screenshots needed.

# See everything in ChatGPT
curl -X POST localhost:3100/deep-query -H "X-API-Key: KEY" -d '{"pid":28968}'
# → 123 elements: buttons, links, inputs, conversations, model selector...

# Click any button by name
curl -X POST localhost:3100/invoke -H "X-API-Key: KEY" -d '{"pid":28968, "name":"Copy", "occurrence":"last"}'
# → Invokes the last Copy button, returns clipboard text

Anti-Screenshot SDK (v1.2)

UAB v1.2 eliminates the need for screenshots in most desktop automation tasks.

Spatial Maps organize UI elements into rows and columns — the agent sees the app layout as structured data instead of pixels. One call replaces a screenshot + vision API analysis.

Composite Engine combines UIA tree + bounding rects + text reading in speed-priority order. Vision becomes a last resort, not the primary method.

MCP Server exposes 15 native tools (desktop_scan, desktop_smart_click, desktop_spatial_map, etc.) that AI agents discover automatically. No skill files, no curl commands — the tools just appear.

Atomic Chains execute multi-step action sequences in a single PowerShell session — no focus stealing between steps. Solves the menu timing problem.

Smart Invoke tries 6 methods to click any element: InvokePattern → SetFocus → ValuePattern → ExpandCollapse → coordinate click → parent invoke.

MCP Setup

UAB exposes 17 native desktop control tools via the Model Context Protocol. Any MCP-compatible agent gets instant access to desktop_scan, desktop_spatial_map, desktop_invoke, desktop_flow, and more — no skill files or HTTP calls needed.

The UAB installer configures MCP automatically for Claude Desktop and Claude Code. For other agents, follow the instructions below.

Claude Desktop (auto-configured by installer)

The installer writes to %APPDATA%\Claude\claude_desktop_config.json (Windows) or ~/Library/Application Support/Claude/claude_desktop_config.json (macOS):

{
  "mcpServers": {
    "desktop-control": {
      "command": "node",
      "args": ["/path/to/uab/dist/mcp-server.js"]
    }
  }
}

Restart Claude Desktop after installation. The desktop control tools appear automatically in both chat and code mode.

Claude Code (auto-configured by installer)

The installer adds the MCP permission to ~/.claude/settings.json. To add manually via CLI:

claude mcp add desktop-control node /path/to/uab/dist/mcp-server.js

Cursor

Add to Cursor's MCP settings (Settings > MCP Servers > Add):

{
  "command": "node",
  "args": ["/path/to/uab/dist/mcp-server.js"]
}

Windsurf / Other Editors

Add to your editor's MCP configuration:

{
  "mcpServers": {
    "desktop-control": {
      "command": "node",
      "args": ["/path/to/uab/dist/mcp-server.js"]
    }
  }
}

Generic MCP Client

Any agent that supports MCP stdio transport can connect:

Command: node
Args: ["/path/to/uab/dist/mcp-server.js"]
Transport: stdio (JSON-RPC 2.0 over stdin/stdout)

Available MCP Tools

Tool	Description
`desktop_scan`	Discover all running GUI applications
`desktop_connect`	Connect to an app by name or PID
`desktop_spatial_map`	Full UI layout as structured rows/columns (RawViewWalker)
`desktop_deep_query`	X-ray: find ALL elements including inner Electron web content
`desktop_invoke`	Directly activate named element by best method
`desktop_flow`	Get learned interaction flow for specific apps
`desktop_smart_click`	Click by name with 6-method fallback cascade
`desktop_chain`	Atomic multi-step action sequence (no focus loss)
`desktop_keypress`	Send keyboard key
`desktop_hotkey`	Send keyboard shortcut
`desktop_act`	Click, type, select, expand, invoke by element ID
`desktop_ui_tree`	Get UI element tree
`desktop_find_elements`	Find elements by type/label
`desktop_window`	Window management (minimize, maximize, etc.)
`desktop_state`	Get app state and window properties
`desktop_focused`	Get currently focused element
`desktop_apps`	List previously discovered apps (instant)

Session 0 Bridge

UAB works even when running in Session 0 (SSH, Windows Services). It automatically detects Session 0 and routes PowerShell through the Task Scheduler with /IT flag to bridge to the interactive desktop session.

Documentation

Document	What's Inside
ARCHITECTURE.md	Smart discovery pipeline, cascade routing, plugin architecture, data flow
GETTING_STARTED.md	Install → scan → discover → connect → control walkthrough
API_REFERENCE.md	Every method, parameter, and return type for UABConnector & AppRegistry
SUPPORTED_APPLICATIONS.md	Tested apps with specific operations and benchmarks
SECURITY.md	Trust boundaries, permission model, audit trail
CONTRIBUTING.md	How to contribute, write plugins, code standards
CHANGELOG.md	Version history

Key Numbers

Metric	Value
Framework plugins	9 (Electron, Browser, Office, Qt, GTK, Java, Flutter, Chrome Extension, Win-UIA)
Framework signatures	10 (Electron, Qt5, Qt6, GTK3, GTK4, WPF, .NET, Flutter, Java, Office)
Element types	32 normalized types
Action types	61 (UI + keyboard + window + Office + browser)
CLI commands	20+ (all JSON output)
Source files	30 TypeScript files (~11,700 LOC)
Apps detected	79+ on typical Windows desktop
Registry lookup	O(1) via dual-indexed Maps

Why UAB Matters

The person who solves reliable, universal app control for agents unlocks the entire "AI operating system" vision without needing anyone's permission. No waiting for app developers to build APIs. No begging SaaS companies for MCP servers. No fragile pixel-scraping.

Smart Function Discovery is the key. Any agent can scan a system, learn what's running, and control it — all with zero configuration. The registry remembers everything across sessions, making each interaction faster than the last.

Hook into the framework, own the interface.

Requirements

Node.js >= 18.0.0
Windows (primary platform — UIA, COM, PowerShell)
Linux/macOS support via framework-specific plugins

Environment Variables

Variable	Default	Description
`UAB_LOG_LEVEL`	`info`	Log level: `debug`, `info`, `warn`, `error`
`UAB_LOG_FILE`	(none)	Optional file path for log output
`LOG_LEVEL`	`info`	Fallback log level (if UAB_LOG_LEVEL not set)

License

Universal App Bridge is licensed under the Business Source License 1.1.

Permitted: Personal use, academic research, evaluation, testing, open source projects.

Requires commercial license: Commercial agent runtimes, SaaS platforms, enterprise internal use (25+ employees), competing products, and deployments to 5+ users/devices.

Patent notice: This software is subject to pending patent applications. The Change Date license conversion does not grant patent rights beyond those stated in the License.

Each version converts to Apache 2.0 four years after release.

See LICENSE for full terms.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github/workflows		.github/workflows
data		data
dist		dist
docs		docs
installer		installer
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
API_REFERENCE.md		API_REFERENCE.md
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
GETTING_STARTED.md		GETTING_STARTED.md
LICENSE		LICENSE
OPERATIONS.md		OPERATIONS.md
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORTED_APPLICATIONS.md		SUPPORTED_APPLICATIONS.md
UAB-BSL-License.docx		UAB-BSL-License.docx
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Universal App Bridge (UAB)

One-Click Install

The Core Innovation: Smart Function Discovery

What Makes This "Smart"?

Quick Start

As a Library

As a CLI (for any AI agent)

As an HTTP Server (for remote / server-side agents)

Environment Auto-Detection

Architecture

The Cascade Pattern

Smart Discovery Deep Dive

Phase 1: Detection

Phase 2: Framework Identification

Phase 3: Registry & Persistence

Phase 4: Smart Lookup

Phase 5: Learning

Supported Frameworks

Unified API

uab.scan() — Discover & Register

uab.find(name) — Smart Lookup

uab.connect(target) — Auto-Connect

uab.enumerate(pid) — List UI Elements

uab.query(pid, selector) — Search Elements

uab.act(pid, elementId, action, params?) — Perform Actions

Production Hardening

Smart Three-Tier Cache

Permission & Safety Model

Health Monitoring

Retry with Backoff

Action Chains

Chrome Extension Bridge

Co-work Bridge

Recursive Application Bridge

X-ray Vision for Agents

Anti-Screenshot SDK (v1.2)

MCP Setup

Claude Desktop (auto-configured by installer)

Claude Code (auto-configured by installer)

Cursor

Windsurf / Other Editors

Generic MCP Client

Available MCP Tools

Session 0 Bridge

Documentation

Key Numbers

Why UAB Matters

Requirements

Environment Variables

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`uab.scan()` — Discover & Register

`uab.find(name)` — Smart Lookup

`uab.connect(target)` — Auto-Connect

`uab.enumerate(pid)` — List UI Elements

`uab.query(pid, selector)` — Search Elements

`uab.act(pid, elementId, action, params?)` — Perform Actions

Packages