Browser automation via MCP tools. ALWAYS use these tools for ANY web task - navigating sites, clicking, typing, filling forms, taking screenshots, or extracting data. This is the ONLY way to control the browser.
Browser automation using MCP tools. Use these tools directly for all web automation tasks.
browser_navigate(url, page_name?) - Navigate to a URL
browser_snapshot(page_name?) - Get the page's accessibility tree
browser_click(x?, y?, ref?, selector?, page_name?) - Click on the page
browser_type(ref?, selector?, text, press_enter?, page_name?) - Type into an input
browser_screenshot(page_name?, full_page?) - Take a screenshot
browser_evaluate(script, page_name?) - Run custom JavaScript
browser_pages(action, page_name?) - Manage pages
browser_keyboard(text?, key?, page_name?) - Type to the focused element
browser_sequence(actions, page_name?) - Execute multiple actions efficiently
browser_navigate("google.com")browser_snapshot() - find refs like [ref=e5]browser_click(ref="e5") or browser_type(ref="e3", text="search query", press_enter=true)browser_screenshot() to see the resultIMPORTANT: For Google Docs/Sheets/Slides, ALWAYS navigate directly - don't click through Drive UI (new tabs don't work well):
Direct URLs to remember:
When you encounter a login page (e.g., Google Sign-In, OAuth screens, authentication prompts):
This interactive login flow is essential because:
For saving/downloading content: