Advanced Visual Intelligence for AEC (FastVLM + ControlNet)
This skill equips Thom (Studio Agent) with "Eyes" to understand architectural visual data and "Hands" to guide generation.
Semantic Plan Reading (FastVLM)
Visual Mood Analysis (FastVLM)
scene_brief.yaml fragment.Visual Control (ControlNet)
Because these models are heavy, this skill acts as a Connector to either:
MLX (Apple Silicon optimized) versions of FastVLM.# Analyze a plan for room types
studio analyze plan projects/demo/plan_clean_001.png --mode semantics
# Extract mood from a reference
studio analyze mood projects/demo/ref_living_room.jpg
# Generate with ControlNet constraint
studio render interior --plan projects/demo/plan_clean_001.png --control canny