Use this skill to initialize the ML pipeline environment. Load and validate all data files, inspect shapes and types, confirm required columns exist, and produce a structured data report for downstream agents. Trigger this skill at the start of any ML task before designing or coding models.