Name: Applied Micro Toolkit
Author: James-Traina

Applied Micro Toolkit

This skill covers applied microeconomic empirical methods and research design. Use when the user is selecting an identification strategy, comparing estimators, running diagnostics, designing a research study, or evaluating an empirical strategy. Triggers on "which method", "what estimator", "how to choose", "method comparison", "empirical strategy", "research design", "applied micro", "identification strategy", "power analysis", "design-based", "model-based", "minimum detectable effect", "specification".

James-Traina7 estrellas29 mar 2026

Ocupación
Categorías: Finanzas e Inversión

Reference for applied micro research design: method selection, diagnostics, inference, pitfalls, reporting standards, and power analysis.

When to Use This Skill

Use when the user is:

Choosing between empirical methods for a causal question
Evaluating which identification strategy fits their data and setting
Running standard diagnostic tests and unsure which ones apply
Designing a study and needs to calculate statistical power
Reviewing or critiquing an empirical strategy
Preparing the "Empirical Strategy" section of a paper
Downloading macroeconomic or cross-national data (see references/data-sources.md for FRED/World Bank API access)

Skip when:

Implementation details for a specific method are needed (use causal-inference skill for IV, DiD, RDD, SC, matching)
The task is structural estimation (use structural-modeling skill)
The task is manuscript preparation or journal logistics (use submission-guide skill)
The task is formal identification proof (use identification-proofs skill)

Reference for applied micro research design: method selection, diagnostics, inference, pitfalls, reporting standards, and power analysis.

When to Use This Skill

Use when the user is:

Choosing between empirical methods for a causal question
Evaluating which identification strategy fits their data and setting
Running standard diagnostic tests and unsure which ones apply
Designing a study and needs to calculate statistical power
Reviewing or critiquing an empirical strategy
Preparing the "Empirical Strategy" section of a paper
Downloading macroeconomic or cross-national data (see references/data-sources.md for FRED/World Bank API access)

Skip when:

Implementation details for a specific method are needed (use causal-inference skill for IV, DiD, RDD, SC, matching)
The task is structural estimation (use structural-modeling skill)
The task is manuscript preparation or journal logistics (use submission-guide skill)
The task is formal identification proof (use identification-proofs skill)

Source of Variation	Method Family	Key Assumption
Randomized assignment (with full compliance)	Experimental analysis (OLS on treatment indicator)	Random assignment
Randomized assignment (with imperfect compliance)	IV / 2SLS using random assignment as instrument	Exclusion restriction, monotonicity
Policy change at a sharp threshold	Sharp RDD	Continuity of potential outcomes at cutoff
Policy change at a threshold with imperfect compliance	Fuzzy RDD (= IV at the cutoff)	Continuity + monotonicity at cutoff
Policy change at a point in time, with affected and unaffected groups	Difference-in-differences	Parallel trends
Staggered policy adoption across units over time	Staggered DiD (Callaway-Sant'Anna, Sun-Abraham, etc.)	Parallel trends (conditional on group and time)
Rare event affecting a single unit, long pre-treatment data	Synthetic control	Pre-treatment fit implies post-treatment counterfactual
Exogenous shifter of treatment that does not affect outcome directly	IV / 2SLS / GMM	Exclusion restriction, relevance, monotonicity
Rich set of observables that plausibly captures all confounders	Matching, IPW, AIPW (selection on observables)	Conditional independence (no unobserved confounders)
No credible exogenous variation	Sensitivity analysis, bounds, partial identification	Depends on bounding assumptions

Method	Must-Run Diagnostics	Key Concern
IV / 2SLS	First-stage F (KP), reduced form, overid test	Weak instruments (F < 10), exclusion restriction
DiD (classic)	Pre-trend F-test, event study plot, raw means by group/period	Parallel trends violation
Staggered DiD	Bacon decomposition, Callaway-Sant'Anna group-time ATTs	Negative TWFE weights with heterogeneous effects
RDD	McCrary density test, covariate balance at cutoff, bandwidth sensitivity	Manipulation of running variable, extrapolation bias
Synthetic Control	Pre-fit RMSPE, permutation p-value, leave-one-out	Pre-period fit quality, donor pool sensitivity
Matching / AIPW	Overlap plots, Love plot (SMD before/after), Oster/Rosenbaum bounds	Lack of overlap, unobserved confounders
Structural	Convergence, identification rank condition, robustness to starting values	Global vs local optimum, identification failure

Mistake	Consequence	Fix
Clustering too fine (individual when treatment is at state level)	SEs too small; over-rejection	Cluster at the level of treatment assignment
Few clusters (< 30–40) with standard cluster-robust SEs	Poor finite-sample properties	Wild cluster bootstrap
Not clustering when treatment varies at group level	SEs dramatically understated	Always cluster at level of treatment assignment

Dimension	Design-Based	Model-Based
Source of randomness	Treatment assignment mechanism	Outcome draws from a superpopulation
Key assumption	Known or modeled treatment assignment	Correct outcome model specification
Examples	Experiments, RCTs, RDD, DiD, natural experiments	Structural models, matching, cross-sectional surveys
Advantages	Transparent; does not require outcome model	More powerful; extends to complex settings

Variable Type	Example	Why It Is Bad
Post-treatment outcome	Controlling for occupation when estimating returns to education	Education affects occupation; conditioning selects on an outcome of treatment
Mediator	Controlling for wages when estimating effect of training on employment	Blocks part of the causal effect
Collider	Conditioning on "survived" when estimating health effects	Opens a non-causal path

Applied Micro Toolkit

When to Use This Skill

Applied Micro Toolkit

When to Use This Skill

Method Selection Decision Tree

Step 1: What is your source of variation?

Standard Diagnostics by Method

Inference Frameworks

Clustering Decision Rule

Design-Based vs Model-Based Inference

Power Analysis

Research Design Checklist

Before Touching Data

During Analysis

Before Submission

Common Pitfalls

Bad Controls

Staggered DiD with Heterogeneous Effects

Forbidden Regressions

Integration

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research

Mistake	Consequence	Fix
Running TWFE with staggered timing	Already-treated units used as controls; negative weights; estimate can have wrong sign	Use Callaway-Sant'Anna, Sun-Abraham, or other modern DiD estimator
Using single post-treatment indicator for all cohorts	Masks heterogeneity in treatment effects across cohorts	Estimate group-time ATTs separately, then aggregate
Not reporting the Bacon decomposition	Reader cannot assess how much of the TWFE estimate comes from problematic comparisons	Report `bacondecomp` output

Applied Micro Toolkit

When to Use This Skill

Applied Micro Toolkit

When to Use This Skill

Method Selection Decision Tree

Step 1: What is your source of variation?

Step 2: Refinements Within Method Families

Standard Diagnostics by Method

Inference Frameworks

Clustering Decision Rule

Design-Based vs Model-Based Inference

Power Analysis

Research Design Checklist

Before Touching Data

During Analysis

Before Submission

Common Pitfalls

Bad Controls

Staggered DiD with Heterogeneous Effects

Forbidden Regressions

Integration

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research