Name: Expand Repos
Author: noemotiovon

Repo Expansion Skill

Starting from output/data_classify.csv, expand organizations into their top repos and merge with the original repos.

Input

output/data_classify.csv — must contain entity_type column (repo / organization / unknown) and 上游地址.

output/repos.csv — merged list of original repos + top-N repos per org (deduplicated, originals win on collision).

python3 scripts/expand_repos.py output/data_classify.csv \
    -o output/repos.csv --top 20 --summary