技能档案

Thinkbrowse Cli

Name: Thinkbrowse Cli
Author: dundas

Control browsers via the ThinkBrowse CLI — navigate pages, interact with elements, extract content, take screenshots. Use when the user asks to browse, scrape, test, audit, or automate anything in a browser using CLI commands. Do NOT use for simple URL fetching (use WebFetch tool instead).

dundas0 星标2026年3月26日

职业
分类: 命令行工具

技能内容

Control real browsers from Claude Code using the thinkbrowse CLI. Two modes:

Local mode — drives your actual Chrome via the ThinkBrowse extension (free, auth cookies available)
Cloud mode — provisions isolated Playwright browsers on Fly.io (credits, headless, stealth)

Quick Start

# Local mode — list tabs, attach, navigate, observe
thinkbrowse tabs
thinkbrowse attach <tabId>
thinkbrowse navigate "https://example.com"
thinkbrowse snapshot              # accessibility tree (best for AI)
thinkbrowse screenshot --output /tmp/page.png

# Cloud mode — create session, navigate, observe, clean up
thinkbrowse cloud start "https://example.com"
thinkbrowse snapshot
thinkbrowse screenshot --output /tmp/page.png
thinkbrowse cloud stop --all

Mode Detection

Signal	Mode
Bridge running + extension connected

相关技能

Thinkbrowse Cli | Skills Pool

thinkbrowse navigate "https://example.com"
sleep 2                                    # Wait for page load + CSRF tokens
thinkbrowse snapshot                       # Read structure before interacting
thinkbrowse screenshot --output /tmp/page.png
# Then use Read tool on /tmp/page.png to view it

thinkbrowse evaluate "[...document.querySelectorAll('button,a,[role=button]')].find(el=>el.textContent.trim()==='Sign In')?.click()"
sleep 1

# React inputs need keyboard events — use `type`, NOT `fill`
thinkbrowse click "input[type=email]"
thinkbrowse type "input[type=email]" "[email protected]"
thinkbrowse click "input[type=password]"
thinkbrowse type "input[type=password]" "password123"
thinkbrowse click "button[type=submit]"
sleep 3

thinkbrowse evaluate "[...document.querySelectorAll('button,[role=tab],[role=button]')].map(b=>({label:b.getAttribute('aria-label'),text:b.textContent.trim(),class:b.className.slice(0,50)})).filter(b=>b.text||b.label)"

thinkbrowse navigate "https://app.example.com"
thinkbrowse wait "button[type=submit]" --timeout 10000
thinkbrowse click "button[type=submit]"

# BAD — "Timeline & Logs" is not a CSS selector
thinkbrowse click "Timeline & Logs"     # ELEMENT_NOT_FOUND

# GOOD — find by text via evaluate
thinkbrowse evaluate "[...document.querySelectorAll('button')].find(b=>b.textContent.includes('Timeline'))?.click()"

# BAD — fill uses native DOM, React won't detect the change
thinkbrowse fill "#search" "query text"

# GOOD — type uses keyboard events, triggers React onChange
thinkbrowse type "#search" "query text"

# BAD — screenshot fails with "fetch failed"
thinkbrowse attach <tabId>
thinkbrowse screenshot --output /tmp/page.png   # ERROR

# GOOD — wait for connection
thinkbrowse attach <tabId>
sleep 2
thinkbrowse screenshot --output /tmp/page.png

# BAD — extract runs before new page loads
thinkbrowse click "a[href='/next-page']"
thinkbrowse extract ".content"           # OLD page content!

# GOOD — wait for new page element
thinkbrowse click "a[href='/next-page']"
thinkbrowse wait "h1" --timeout 5000
thinkbrowse extract ".content"

# ALWAYS release/stop when done
thinkbrowse release                # local mode
thinkbrowse cloud stop --all       # cloud mode

{"success": true, "command": "navigate", "durationMs": 1234, "data": {...}}
{"success": false, "error": "...", "code": "ELEMENT_NOT_FOUND", "hint": "...", "retryable": true}

thinkbrowse navigate "https://site-a.com" --tab 12345
thinkbrowse navigate "https://site-b.com" --tab 67890

thinkbrowse doctor                        # Check bridge, extension, config
thinkbrowse config show                   # Show current config
thinkbrowse config set apiKey <key>       # Set API key for cloud mode
thinkbrowse agent-init [--name <name>]    # Set agent identity

Action	Wait After
`navigate`	`sleep 2` (page load + CSRF)
`click`	`sleep 1` (state change)
`type` / `fill`	No wait needed
`attach` (new tab)	`sleep 2` (connection)
`evaluate` (click)	`sleep 1`
`scroll`	`sleep 1` (rendering)

Thinkbrowse Cli

Quick Start

Mode Detection

Thinkbrowse Cli

Quick Start

Mode Detection

Top 5 Patterns

1. Navigate and Observe

2. Click by Text Content (RECOMMENDED)

3. Fill React Forms

4. List All Interactive Elements

5. Wait Then Act

Top 5 Anti-Patterns

1. Using text as a CSS selector

2. Using `fill` on React inputs

3. Screenshotting immediately after attach

4. Clicking a link without waiting for navigation

5. Not cleaning up sessions

Output Format

Timing Guidelines

Multi-Agent Coordination

Setup & Diagnostics

References

Oracle

Blucli

Peekaboo

Add Dock Band

Add Fallback Commands

Add Adaptive Card Form

Thinkbrowse Cli

Quick Start

Mode Detection

Thinkbrowse Cli

Quick Start

Mode Detection

Top 5 Patterns

1. Navigate and Observe

2. Click by Text Content (RECOMMENDED)

3. Fill React Forms

4. List All Interactive Elements

5. Wait Then Act

Top 5 Anti-Patterns

1. Using text as a CSS selector

2. Using fill on React inputs

3. Screenshotting immediately after attach

4. Clicking a link without waiting for navigation

5. Not cleaning up sessions

Output Format

Timing Guidelines

Multi-Agent Coordination

Setup & Diagnostics

References

Oracle

Blucli

Peekaboo

Add Dock Band

Add Fallback Commands

Add Adaptive Card Form

2. Using `fill` on React inputs