Use this skill when the user wants questions about what someone did, what happens next, which way something moved, whether one movement is faster or slower, or when two clips look almost the same unless you watch the motion carefully. Trigger it for requests like 'make questions about tiny movement differences', 'ask what happened right after', 'which way did it go', 'who moved faster', or 'make the answer depend on the action sequence rather than the still image.'
[Case 1]
To synthesize data for this capability, you must strictly follow a 3-phase pipeline. Do not hallucinate steps. Read the corresponding reference file for each phase sequentially:
Phase 1: Environment Exploration
Read the exploration guidelines to discover raw knowledge seeds:
references/EXPLORATION.md
Phase 2: Trajectory Selection
Once Phase 1 is complete, read the selection criteria to evaluate the trajectory:
references/SELECTION.md
Phase 3: Data Synthesis
Once a trajectory passes Phase 2, read the synthesis instructions to generate the final data:
references/SYNTHESIS.md