Use when implementing or updating data loading so every dataset source, transform, split, and filtering decision is documented and reproducible.
src, or pipeline workflows.data_lineage.md + quick stats).src, mirror traceability notes in module docs.source_pathretrieval_datedataset_version_or_hashfilters_appliedtransform_sequencesplit_strategyseedoutput_counts