Name: Skill: action-and-motion-understanding
Author: Dingxingdi

Skill: action-and-motion-understanding

Use this skill when the user wants questions about what someone did, what happens next, which way something moved, whether one movement is faster or slower, or when two clips look almost the same unless you watch the motion carefully. Trigger it for requests like 'make questions about tiny movement differences', 'ask what happened right after', 'which way did it go', 'who moved faster', or 'make the answer depend on the action sequence rather than the still image.'

Dingxingdi0 스타2026. 4. 8.

직업
카테고리: Frontend

1. Capability Definition & Real Case

Professional Definition: The capability to infer what agents or objects are doing from temporally evolving evidence, including fine-grained action type, relative motion, motion direction, and immediate next-step dynamics that cannot be resolved from a single frame.
Dimension Hierarchy: Temporal-Spatial Understanding->Dynamic Event Perception->action-and-motion-understanding

Real Case

[Case 1]

Initial Environment: A 16-second third-person kitchen clip. A person places a food box on the counter, reaches toward a cabinet, and slightly shifts their body orientation before touching the handle. There are no subtitles, and the background objects stay almost unchanged throughout the clip.
Real Question: What will the person do next?
Real Trajectory: Observe the hand trajectory after the box is placed, note the person reorients toward the cabinet rather than the table, and verify that the motion continues into a handle-reaching gesture instead of a pickup or turning-away action.

Skill: action-and-motion-understanding

Dingxingdi0 스타2026. 4. 8.

직업
카테고리: Frontend

1. Capability Definition & Real Case

Professional Definition: The capability to infer what agents or objects are doing from temporally evolving evidence, including fine-grained action type, relative motion, motion direction, and immediate next-step dynamics that cannot be resolved from a single frame.

Dimension Hierarchy: Temporal-Spatial Understanding->Dynamic Event Perception->action-and-motion-understanding

Real Case

[Case 1]

Initial Environment: A 16-second third-person kitchen clip. A person places a food box on the counter, reaches toward a cabinet, and slightly shifts their body orientation before touching the handle. There are no subtitles, and the background objects stay almost unchanged throughout the clip.

Real Question: What will the person do next?

Real Trajectory: Observe the hand trajectory after the box is placed, note the person reorients toward the cabinet rather than the table, and verify that the motion continues into a handle-reaching gesture instead of a pickup or turning-away action.

Skill: action-and-motion-understanding

1. Capability Definition & Real Case

Real Case

Skill: action-and-motion-understanding

1. Capability Definition & Real Case

Real Case

Pipeline Execution Instructions

Extract Errors

Frontend Patterns

Coding Standards

Coding Standards

Coding Standards

Frontend Patterns