Name: Desktop Computer Automation
Author: fdsasaaa

Desktop Computer Automation

Vision-driven desktop automation using Midscene. Control your desktop (macOS, Windows, Linux) with natural language commands. Operates entirely from screenshots — no DOM or accessibility labels required. Can interact with all visible elements on screen regardless of technology stack. Triggers: open app, press key, desktop, computer, click on screen, type text, screenshot desktop, launch application, switch window, desktop automation, control computer, mouse click, keyboard shortcut, screen capture, find on screen, read screen, verify window, close app, minimize window, maximize window Powered by Midscene.js (https://midscenejs.com)

fdsasaaa0 星標2026年3月8日

職業
分類: CLI 工具

CRITICAL RULES — VIOLATIONS WILL BREAK THE WORKFLOW:

Never run midscene commands in the background. Each command must run synchronously so you can read its output (especially screenshots) before deciding the next action. Background execution breaks the screenshot-analyze-act loop.

Run only one midscene command at a time. Wait for the previous command to finish, read the screenshot, then decide the next action. Never chain multiple commands together.

Allow enough time for each command to complete. Midscene commands involve AI inference and screen interaction, which can take longer than typical shell commands. A typical command needs about 1 minute; complex act commands may need even longer.

Always report task results before finishing. After completing the automation task, you MUST proactively summarize the results to the user — including key data found, actions completed, screenshots taken, and any relevant findings. Never silently end after the last automation step; the user expects a complete response in a single interaction.

Control your desktop (macOS, Windows, Linux) using . Each CLI command maps directly to an MCP tool — you (the AI agent) act as the brain, deciding which actions to take based on screenshots.

Desktop Computer Automation

fdsasaaa0 星標2026年3月8日

職業
分類: CLI 工具

CRITICAL RULES — VIOLATIONS WILL BREAK THE WORKFLOW:

Never run midscene commands in the background. Each command must run synchronously so you can read its output (especially screenshots) before deciding the next action. Background execution breaks the screenshot-analyze-act loop.

Run only one midscene command at a time. Wait for the previous command to finish, read the screenshot, then decide the next action. Never chain multiple commands together.

Allow enough time for each command to complete. Midscene commands involve AI inference and screen interaction, which can take longer than typical shell commands. A typical command needs about 1 minute; complex act commands may need even longer.

Always report task results before finishing. After completing the automation task, you MUST proactively summarize the results to the user — including key data found, actions completed, screenshots taken, and any relevant findings. Never silently end after the last automation step; the user expects a complete response in a single interaction.

Control your desktop (macOS, Windows, Linux) using . Each CLI command maps directly to an MCP tool — you (the AI agent) act as the brain, deciding which actions to take based on screenshots.

Desktop Computer Automation

Desktop Computer Automation

Prerequisites

Commands

Connect to Desktop

List Displays

Take Screenshot

Perform Action

Disconnect

Workflow Pattern

Best Practices

Troubleshooting

macOS: Accessibility Permission Denied

macOS: Xcode Command Line Tools Not Found

API Key Not Set

macOS: Screenshot Fails with `system_profiler` Not Found

AI Cannot Find the Element

Oracle

Blucli

Peekaboo

Add Dock Band

Add Fallback Commands

Add Adaptive Card Form

Desktop Computer Automation

Desktop Computer Automation

Prerequisites

Commands

Connect to Desktop

List Displays

Take Screenshot

Perform Action

Disconnect

Workflow Pattern

Best Practices

Troubleshooting

macOS: Accessibility Permission Denied

macOS: Xcode Command Line Tools Not Found

API Key Not Set

macOS: Screenshot Fails with system_profiler Not Found

AI Cannot Find the Element

Oracle

Blucli

Peekaboo

Add Dock Band

Add Fallback Commands

Add Adaptive Card Form

macOS: Screenshot Fails with `system_profiler` Not Found