claw-relay

Control a remote browser through Claw Relay. Use when you need to navigate authenticated websites, click buttons, fill forms, take screenshots, or read page content on a user's real browser — especially when the agent runs on a different machine (cloud, container, server) than the browser. Triggers on remote browser control, authenticated browsing, real browser, cookie-based access, browser relay.

AndreaGriffiths11 1 Updated 3mo ago

Resources

GitHub

Install

npx skillscat add andreagriffiths11/claw-relay

Install via the SkillsCat registry.

SKILL.md

Claw Relay — Remote Browser Control

You control a real Chrome browser through a WebSocket relay. The browser runs on the user's machine with their real cookies and sessions. You run anywhere.

Connection

Connect via WebSocket. Auth first, then send actions.

const ws = new WebSocket('wss://<relay-url>');

// First message must be auth
ws.send(JSON.stringify({
  type: 'auth',
  token: '<agent-token>',
  agent_id: '<your-agent-id>'
}));

The relay URL and token are provided by the user or set as environment variables:

CLAW_RELAY_URL — WebSocket URL (e.g. wss://relay.example.com)
CLAW_RELAY_TOKEN — agent auth token
CLAW_RELAY_AGENT — your agent identifier

Actions

After auth succeeds, send actions as JSON:

Action	Scope	Payload
`snapshot`	`read`	`{"type": "snapshot"}` — returns accessibility tree
`screenshot`	`read`	`{"type": "screenshot"}` — returns base64 PNG via WebSocket
`click`	`interact`	`{"type": "click", "ref": "e5"}`
`type`	`interact`	`{"type": "type", "ref": "e3", "text": "hello"}`
`fill`	`interact`	`{"type": "fill", "ref": "e3", "text": "hello"}`
`press`	`interact`	`{"type": "press", "key": "Enter"}`
`hover`	`interact`	`{"type": "hover", "ref": "e2"}`
`select`	`interact`	`{"type": "select", "ref": "e7", "values": ["opt1"]}`
`navigate`	`navigate`	`{"type": "navigate", "url": "https://..."}`
`evaluate`	`execute`	`{"type": "evaluate", "js": "document.title"}`
`close`	`navigate`	`{"type": "close"}` — closes current tab

Responses

{"type": "result", "action": "snapshot", "ok": true, "data": "...accessibility tree..."}
{"type": "result", "action": "screenshot", "ok": true, "data": "<base64-encoded-png>", "mimeType": "image/png"}
{"type": "error", "code": "permission_denied", "message": "Agent lacks 'interact' scope"}
{"type": "error", "code": "site_blocked", "message": "mail.google.com is blocked"}

Workflow Pattern

Snapshot first — read the page structure before acting
Find elements by ref — the accessibility tree assigns refs (e.g. e1, e5) to interactive elements
Act on refs — click, type, fill using the ref from the snapshot
Snapshot again — verify the page changed as expected
Repeat — navigate → snapshot → act → verify

snapshot → find button ref → click ref → snapshot → verify

Scopes (least privilege)

Request only what you need:

read — snapshot, screenshot. Start here.
interact — click, type, fill, hover, select. Adds the ability to change things.
navigate — go to URLs. Can access any allowed site as the logged-in user.
execute — run JavaScript on the page. Nuclear option. Avoid unless necessary.

Security Constraints

Allowlist — your agent can only access sites explicitly allowed in its config
Blocklist — banking, email, and auth providers are always blocked regardless of allowlist
Rate limiting — actions are rate-limited per agent (token bucket)
Audit log — every action is logged with timestamps, agent ID, action, target, and result

What Makes This Different

Local browser tools require agent and browser on the same machine. Claw Relay doesn't. Your agent runs anywhere — cloud, server, container — and controls the user's real browser remotely. Real cookies, real sessions, real logins. No headless browser, no fake profiles.

Setup

Users set up the relay on their machine:

npx @acolombiadev/claw-relay

This launches Chrome, generates config with random tokens, and starts the relay. One command, zero setup.

Common Tasks

Read a page

{"type": "navigate", "url": "https://github.com/notifications"}
// wait for response
{"type": "snapshot"}

Fill and submit a form

{"type": "snapshot"}
// find input ref from tree, e.g. e3
{"type": "fill", "ref": "e3", "text": "search query"}
{"type": "press", "key": "Enter"}

Click a button

{"type": "snapshot"}
// find button ref, e.g. e7
{"type": "click", "ref": "e7"}

Error Handling

permission_denied — you lack the required scope. Ask the user to upgrade your agent config.
site_blocked — the site is on the global blocklist (cannot be overridden) or not in your allowlist. Ask the user to check the config.
rate_limited — slow down. Wait and retry.
engine_error — browser or CDP issue. The page may have navigated or the element may be stale. Re-snapshot and retry.

claw-relay

Resources

Install

Claw Relay — Remote Browser Control

Connection

Actions

Responses

Workflow Pattern

Scopes (least privilege)

Security Constraints

What Makes This Different

Setup

Common Tasks

Read a page

Fill and submit a form

Click a button

Error Handling

Categories

Install

Recommended Skills