Extract and score leads from GitHub repositories by analyzing stars, forks, issues, PRs, comments, and contributions. Produces unified multi-repo CSV with deduplicated user profiles. No paid API credits required.
Extract high-intent leads from one or more GitHub repositories by analyzing every type of user interaction. This skill uses only free GitHub API data — no enrichment credits are spent.
Note: If the user describes their ICP as GitHub-active but hasn't identified specific repositories yet, this skill still applies. In that case, ask the user which repositories their ICP is likely to interact with, or help them identify relevant repos based on the technology/space they describe.
gh CLI authenticated (gh auth status to verify)PyYAML installedBefore running, ask the user for:
owner/repo stringsgh auth status
python3 ${CLAUDE_SKILL_DIR}/scripts/gh_repo_signals.py \
--repos "owner1/repo1,owner2/repo2" \
--limit <USER_LIMIT> \
--output ${CLAUDE_SKILL_DIR}/../.tmp/repo_signals.csv
Replace the repos and limit with user-provided values.
The tool will:
_users.csv and _interactions.csvThe tool produces two CSV files:
repo_signals_users.csv — One row per person, deduplicated across all repos
| Column | Description |
|---|---|
| username | GitHub login |
| name | Display name |
| Public GitHub email | |
| commit_email | Email from git commits (if different from public) |
| company | Company from GitHub profile |
| location | Location from GitHub profile |
| blog | Website/blog URL |
| Twitter/X handle | |
| bio | GitHub bio |
| followers | Follower count |
| public_repos | Number of public repos |
| total_repos_interacted | Number of input repos this user interacted with |
| interaction_score | Weighted score across all repos |
repo_signals_interactions.csv — One row per user x repo combination
| Column | Description |
|---|---|
| username | GitHub login |
| repository | Which repo this row is about |
| is_contributor | YES/NO |
| is_stargazer | YES/NO |
| is_forker | YES/NO |
| is_watcher | YES/NO |
| is_issue_opener | YES/NO |
| is_pr_author | YES/NO |
| is_issue_commenter | YES/NO |
| contribution_count | Number of commits (0 if not contributor) |
| starred_at | Date starred (if applicable) |
| forked_at | Date forked (if applicable) |
| repo_score | Interaction score for this specific repo |
Once the CSV files are generated, do not stop. Immediately proceed to analyze the data and brief the user.
Check if you already know the user's company and intent from prior conversation. If not, ask:
"Before I analyze these results, I need to understand who you're finding leads for:
- What does your company/product do? (one-liner is fine)
- Who is your ideal customer? (role, company size, industry, tech stack — whatever is relevant)
- What's the goal for these leads? (outbound sales, partnership, hiring, community building, etc.)"
Do NOT proceed to analysis until you have this context. It directly shapes the recommendations.
Read the generated .csv file and compute the following analysis. Present it to the user as a structured briefing.
6a. Overall Stats
6b. Multi-Repo Users (if multiple repos were scanned)
6c. Top Companies
6d. Interaction Patterns
6e. Data Gaps
Based on the analysis AND the user's company context/intent, recommend specific next steps. Tailor recommendations to what the data actually shows — do not give generic advice.
Framework for recommendations:
If multi-repo users exist (2+ repos):
If company clusters exist (3+ users from same company):
/enrich-company/enrich-lead to find the decision-maker at those companies (not the developer who starred — the person who signs off on purchases)If high email coverage (>40%):
/qa-agent to qualify them against ICP before reaching outIf low email coverage (<40%):
/find-email for the top-scored users firstIf the user's goal is outbound sales:
If the user's goal is community/partnerships:
Always include a cost estimate:
Format the recommendation as a clear action plan with numbered steps, estimated costs, and expected outcomes.
After presenting the analysis and recommendations, ask:
"Would you like me to proceed with any of these steps? I can start with [recommended first action] — it would cost approximately [estimate] and take [time estimate]."
Wait for user confirmation before spending any credits or running enrichment tools.