Name: Skill: dexterous-spatial-ui-manipulation
Author: Dingxingdi

Skill: dexterous-spatial-ui-manipulation

Use this skill when the user wants tasks requiring precise physical interaction across coordinates, such as drag-and-drop, selecting specific text spans, adjusting sliders, or drawing/arranging elements based on spatial constraints. Trigger it for requests like “highlight the second sentence,” “select the paragraph starting with 'For',” “move the file to the folder,” “set the slider to 75%,” “draw a circle inside the square,” or “draw a 3x3 grid.” This skill is for GUI tasks where the agent must resolve spatial start/end points, compute geometric bounding boxes, or maintain continuous motion control to manipulate UI elements precisely.

Dingxingdi0 starsApr 10, 2026

Occupation
Categories: Game Development

1. Capability Definition & Real Case

Professional Definition: The capability to perform spatially-aware, continuous physical interactions within a GUI, calculating and executing actions involving multi-point trajectories (drag-and-drop), precise 2D coordinate resolutions (drawing geometric shapes within relational bounds), and area-based selections (character-level text highlighting). This skill requires the agent to move beyond discrete clicking to compute relative positioning, coordinate-based intersections, and visual text-snapping mechanisms across dense or blank-canvas environments.
Dimension Hierarchy: GUI Perception and Environment Modeling->Element Grounding->dexterous-spatial-ui-manipulation

Real Case

[Case 1]

Initial Environment: A document editor is open showing a text-dense page with multiple paragraphs and sentences. The interface context is a scoped application window focus.
Real Question: Drag to select the second sentence of the first paragraph.
Real Trajectory: Locate the first paragraph block, identify the punctuation marking the end of the first sentence, calculate the precise starting coordinate (x,y) at the beginning of the second sentence's first word, initiate a drag action, move the cursor to the coordinate (x',y') following the last word of the second sentence, and release.

Skill: dexterous-spatial-ui-manipulation

Dingxingdi0 starsApr 10, 2026

Occupation
Categories: Game Development

1. Capability Definition & Real Case

Professional Definition: The capability to perform spatially-aware, continuous physical interactions within a GUI, calculating and executing actions involving multi-point trajectories (drag-and-drop), precise 2D coordinate resolutions (drawing geometric shapes within relational bounds), and area-based selections (character-level text highlighting). This skill requires the agent to move beyond discrete clicking to compute relative positioning, coordinate-based intersections, and visual text-snapping mechanisms across dense or blank-canvas environments.

Dimension Hierarchy: GUI Perception and Environment Modeling->Element Grounding->dexterous-spatial-ui-manipulation

Real Case

[Case 1]

Initial Environment: A document editor is open showing a text-dense page with multiple paragraphs and sentences. The interface context is a scoped application window focus.

Real Question: Drag to select the second sentence of the first paragraph.

Real Trajectory: Locate the first paragraph block, identify the punctuation marking the end of the first sentence, calculate the precise starting coordinate (x,y) at the beginning of the second sentence's first word, initiate a drag action, move the cursor to the coordinate (x',y') following the last word of the second sentence, and release.

Skill: dexterous-spatial-ui-manipulation

1. Capability Definition & Real Case

Real Case

Skill: dexterous-spatial-ui-manipulation

1. Capability Definition & Real Case

Real Case

Pipeline Execution Instructions

Prose

Golang Patterns

Audiocraft Audio Generation

Pokemon Player

Ideation

Storybook Upgrade