Name: Data Analysis
Author: nicoladevera

Data Analysis

Interprets product data and metrics in context, classifying analysis type and producing calibrated insights. Use when asked to "analyze this data", "what does this metric mean", "interpret these numbers", "help me understand this trend", or "analyze this funnel drop-off".

nicoladevera0 estrellas6 abr 2026

Ocupación
Categorías: Ventas y Marketing

Answer a data question in product context. The skill's job is to turn a product question and data into an insight — not to restate numbers, but to interpret what they mean and what to do about them. The analysis type adapts to the question: metric interpretation, funnel analysis, cohort comparison, or anomaly investigation.

For numeric analyses, the output is not final until it has been bundled and replay-verified. This skill now requires a reproducibility bundle (inputs, derived tables, calculation log, saved code, charts, manifest) and a replay pass using .claude/skills/data-analysis/analysis_runner.py.

What It Accepts

Any combination of a question and data:

A data question ("Why did signups drop 15% last week?")
Pasted data (CSV, table, dashboard export, or described data)
A metric and context ("Our activation rate is 23% — is that good?")
Funnel numbers at each stage
Cohort data with a comparison question
A question without data (skill describes what data is needed and what the expected answer shape looks like)
A combination of the above

The input does not need to be clean or complete. This skill works with what's provided and names what's missing.

What It Accepts

Any combination of a question and data:

A data question ("Why did signups drop 15% last week?")
Pasted data (CSV, table, dashboard export, or described data)
A metric and context ("Our activation rate is 23% — is that good?")
Funnel numbers at each stage
Cohort data with a comparison question
A question without data (skill describes what data is needed and what the expected answer shape looks like)
A combination of the above

The input does not need to be clean or complete. This skill works with what's provided and names what's missing.

## Data Analysis: [Question or Topic] **Analysis type:** [Metric interpretation / Funnel analysis / Cohort analysis / Anomaly investigation]  ```yaml agent_block: skill: data-analysis analysis_type: [Metric interpretation / Funnel analysis / Cohort analysis / Anomaly investigation] finding: "[One sentence summary of the key finding]" confidence: [High / Medium / Low] top_hypothesis: [integer — rank 1 hypothesis number] recommended_action: [Pull more data / Run experiment / Act on finding / Monitor] sample_size_adequate: [Yes / No / Unknown] run_dir: knowledge/data-analyses/YYYY-MM-DD-analysis-slug report: knowledge/data-analyses/YYYY-MM-DD-analysis-slug/report.md code_artifact: knowledge/data-analyses/YYYY-MM-DD-analysis-slug/analysis.py calc_log_artifact: knowledge/data-analyses/YYYY-MM-DD-analysis-slug/calc-log.jsonl source_artifacts: - knowledge/data-analyses/YYYY-MM-DD-analysis-slug/inputs/source_01.csv derived_artifacts: - knowledge/data-analyses/YYYY-MM-DD-analysis-slug/derived/table_01.csv manifest_artifact: knowledge/data-analyses/YYYY-MM-DD-analysis-slug/manifest.yaml verification_artifact: knowledge/data-analyses/YYYY-MM-DD-analysis-slug/verification.json verification_status: [Passed / Failed / Not Required] charts: - knowledge/data-analyses/YYYY-MM-DD-analysis-slug/chart.png ```  --- ### Question [Restate the question clearly. If the question was implicit, make it explicit.] --- ### Key Finding **Finding:** [1-2 sentences. The answer, stated directly. Cite key numeric claims with `[calc:calc_id]`.] **Confidence:** [High / Medium / Low] — [One sentence on what drives this confidence level — sample size, data quality, or evidence strength] **Top Hypothesis:** Hypothesis #[N] — [brief label from the Hypotheses table below] **Recommended Action:** [Pull more data / Run experiment / Act on finding / Monitor] --- ### Analysis [Structured analysis appropriate to the type. For metric interpretation: context (baseline, variance, seasonality), the movement, candidate explanations. For funnel analysis: stage-by-stage data, drop-off analysis, segmentation findings. For cohort analysis: cohort definitions, comparison table, divergence analysis. For anomaly investigation: characterization (timing, magnitude, scope, shape), hypotheses. Show the work. Include tables, calculations, and comparisons as appropriate.] --- ### Visualizations ![Insight-first title stating the key finding](./chart.png) *Chart 1: [One sentence — what the chart shows and what the reader should conclude. Cite the reference line or comparison anchor used.]* ![Second insight-first title](./chart_2.png) *Chart 2: [Caption.] — Omit this entry if only one chart was produced.* --- ### Hypotheses | Rank | # | Hypothesis | Evidence For | Evidence Against | Likelihood | |------|---|-----------|-------------|-----------------|------------| | 1 | [#] | [Most likely hypothesis] | [What supports it] | [What contradicts it] | [High / Medium / Low] | | 2 | [#] | [Second most likely] | [Evidence for] | [Evidence against] | [Likelihood] | --- ### Limitations - [Named limitation with explanation of how it affects the analysis] - [Named limitation] --- ### Recommended Next Steps 1. [What to do next — additional data, experiment, action, or monitoring] 2. [Second recommendation] --- ### Reproducibility - **Verification:** [Passed / Failed / Not Required] - **Runner command:** `python3 .claude/skills/data-analysis/analysis_runner.py verify --run-dir knowledge/data-analyses/YYYY-MM-DD-analysis-slug` - **Important calc IDs:** [List the critical `calc_id`s cited in the report] - **Bundle contents:** `report.md`, `analysis.py`, `inputs/`, `derived/`, `calc-log.jsonl`, `manifest.yaml`, `verification.json`, charts, and `replay/` --- ### Smell Test - **Smell 2 (No Way to Measure):** [Can the available data actually answer the question? Finding or "Clear — data is sufficient for the question asked"] - **Smell 5 (False Precision):** [Is confidence calibrated to sample size and data quality? Finding or "Clear — confidence levels match the evidence"] > **Context note:** [State which substantive company files were loaded, which were absent, and which were stub templates. Note what the analysis might miss without product context or data source context.]

Type	Triggered by
Metric interpretation	"What does this number mean?" "Why did X change?" "Is this good or bad?"
Funnel analysis	Funnel data provided, or question about conversion/drop-off
Cohort analysis	Comparison between user groups, or question about how different segments behave
Anomaly investigation	Something unexpected happened — a spike, drop, or unusual pattern

Analysis type	Primary chart	Secondary (if a second finding warrants it)
Metric interpretation	Line/time-series with the key movement annotated	Bar showing magnitude vs. baseline
Funnel analysis	Horizontal waterfall — absolute users at each stage	Segmented bar at the biggest drop-off step
Cohort analysis	Grouped bar (cohorts × metric, same time window)	Dot plot or heatmap for retention over time
Anomaly investigation	Time series with the anomaly window highlighted	Breakdown bar showing which segment drives the scope

Data Analysis

What It Accepts

Data Analysis

What It Accepts

Intake

Signals to Check

Adaptive Response

Instructions

1. Read the input fully

2. Load reference files

3. Load company context (if available)

4. Classify the analysis type

5. Check data sufficiency

6. Analyze

6.5. Generate visualizations

6.6. Prepare the reproducibility bundle

6.7. Finalize with replay verification

7. Run the smell test

8. State limitations

10. Sort hypotheses and populate the Agent Block

Output Format

Quality Bar

Save

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc

Data Analysis

What It Accepts

Data Analysis

What It Accepts

Intake

Signals to Check

Adaptive Response

Instructions

1. Read the input fully

2. Load reference files

3. Load company context (if available)

4. Classify the analysis type

5. Check data sufficiency

6. Analyze

6.5. Generate visualizations

6.6. Prepare the reproducibility bundle

6.7. Finalize with replay verification

7. Run the smell test

8. State limitations

9. Recommend next steps

10. Sort hypotheses and populate the Agent Block

Output Format

Quality Bar

Save

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc