Name: Gui Agent
Author: openclaw

STEP 0: Activate Platform (MANDATORY FIRST STEP)

Before any GUI operation, run:

python3 {baseDir}/scripts/activate.py

This detects your OS, sets up the correct action commands, and outputs platform context. After running, {baseDir}/actions/_actions.yaml contains your platform's commands.

Workflow

OBSERVE → LEARN → ACT → VERIFY → SAVE

OBSERVE — Take screenshot → run OCR + detector → understand current state → read {baseDir}/skills/gui-observe/SKILL.md
LEARN — First time with an app? Save components to memory → read {baseDir}/skills/gui-learn/SKILL.md → learn_from_screenshot() auto-outputs app tips if available

Sub-Skill	When to read
`skills/gui-observe/SKILL.md`	Before screenshots or detection
`skills/gui-learn/SKILL.md`	Before learning a new app
`skills/gui-act/SKILL.md`	Before any click/type action
`skills/gui-memory/SKILL.md`	For memory structure details
`skills/gui-workflow/SKILL.md`	For multi-step navigation
`skills/gui-setup/SKILL.md`	For first-time machine setup
`skills/gui-report/SKILL.md`	For task performance reporting

Gui Agent

STEP 0: Activate Platform (MANDATORY FIRST STEP)

Workflow

Gui Agent

STEP 0: Activate Platform (MANDATORY FIRST STEP)

Workflow

Core Rules

Sub-Skills Reference

Liquid Glass Design

Compose Multiplatform Patterns

Foundation Models On Device

Swiftui Patterns

Foundation Models On Device

Swiftui Patterns