Overview

This skill provides comprehensive capabilities for data analysis workflows on CSV datasets. It automatically analyzes missing value patterns, intelligently imputes missing data using appropriate statistical methods, and creates interactive Plotly Dash dashboards for visualizing trends and patterns. The skill combines automated missing value handling with rich interactive visualizations to support end-to-end exploratory data analysis.

Core Capabilities

The data-analyst skill provides three main capabilities that can be used independently or as a complete workflow:

1. Missing Value Analysis

Automatically detect and analyze missing values in datasets, identifying patterns and suggesting optimal imputation strategies.

2. Intelligent Imputation

Apply sophisticated imputation methods tailored to each column's data type and distribution characteristics.

3. Interactive Dashboard Creation

Generate comprehensive Plotly Dash dashboards with multiple visualization types for trend analysis and exploration.

Overview

Core Capabilities

The data-analyst skill provides three main capabilities that can be used independently or as a complete workflow:

1. Missing Value Analysis

Automatically detect and analyze missing values in datasets, identifying patterns and suggesting optimal imputation strategies.

2. Intelligent Imputation

Apply sophisticated imputation methods tailored to each column's data type and distribution characteristics.

3. Interactive Dashboard Creation

Generate comprehensive Plotly Dash dashboards with multiple visualization types for trend analysis and exploration.

Data Analyst

Overview

Core Capabilities

1. Missing Value Analysis

2. Intelligent Imputation

3. Interactive Dashboard Creation

Data Analyst

Overview

Core Capabilities

1. Missing Value Analysis

2. Intelligent Imputation

3. Interactive Dashboard Creation

Complete Workflow

Step 1: Analyze Missing Values

Step 2: Impute Missing Values

Step 3: Create Interactive Dashboard

Individual Use Cases

Use Case A: Quick Missing Value Assessment

Use Case B: Imputation Only

Use Case C: Visualization Only

Use Case D: Custom Imputation Strategy

Understanding Imputation Methods

Dashboard Features

Summary Statistics

Time Series Analysis

Distribution Analysis

Correlation Analysis

Categorical Analysis

Scatter Plot Matrix

Setup and Dependencies

Best Practices

For Analysis:

For Imputation:

For Dashboards:

Handling Edge Cases

High Missing Rates (>50%)

Mixed Data Types

Small Datasets

Time Series Gaps

Troubleshooting

Script fails with "module not found"

Dashboard won't start (port in use)

KNN imputation is slow

Imputed values seem incorrect

Resources

scripts/

references/

Other Files

Data Analyst

Project Planner

Market Sizing Analysis

KPI Dashboard Design

Data Storytelling

Apple Notes