Design and implement repeatable preprocessing pipelines for cleaning, encoding, transforming, and validating ML input data. In governed ML routing this skill is a stage assistant: it helps on preprocessing-heavy steps after the main route owner is chosen, and should not take over the whole ML workflow by itself.
In governed ML routing, treat this skill as a stage assistant. It is for preprocessing-heavy execution after the pack owner is chosen.
Use this skill when:
scikit-learn, ml-pipeline-workflow, or training-machine-learning-modelsml-data-leakage-guardscientific-data-preprocessingml-data-leakage-guard before trusting fitted preprocessing stepssplitting-datasets when the next narrow problem is partition strategy