Comprehensive standards for economics research covering methodology, econometric reporting, causal inference, data practices, and publication conventions across microeconomics, macroeconomics, econometrics, finance, development economics, behavioral economics, and health economics.
When to Use
When conducting empirical economics research (micro, macro, applied)
When specifying and estimating econometric models
When designing causal identification strategies
When preparing regression tables and results for publication
When writing or reviewing NBER working papers
When submitting to AEA journals (AER, AEJ, JEP, JEL)
When working with panel data, time series, or cross-sectional datasets
When pre-registering randomized controlled trials or field experiments
When preparing replication packages for journal submission
Protocol
1. Research Standards
관련 스킬
1.1 Citation and Style
Chicago Manual of Style (Author-Date) -- the dominant citation style in economics
In-text: (Acemoglu & Robinson, 2012)
Bibliography: Acemoglu, Daron, and James A. Robinson. 2012. Why Nations Fail. New York: Crown.
Journal-specific styles -- always check the target journal's style guide; AER, QJE, Econometrica, and JPE each have minor variations
BibTeX -- use @article, @book, @techreport (for working papers), and @incollection (for handbook chapters); maintain a clean .bib file
Working paper convention -- cite as "(Author, Year)" with the working paper number; update citations to published versions before final submission
1.2 NBER Working Paper Conventions
Include NBER working paper number prominently
Add the JEL classification codes (e.g., C23, D72, O15)
Include a structured abstract (250 words or fewer for most journals)
Acknowledge funding sources, data providers, and seminar participants
Standard disclaimer: "The views expressed herein are those of the authors and do not necessarily reflect the views of the National Bureau of Economic Research."
Cluster standard errors at the entity level (or the level of treatment variation)
Report within R-squared
Random Effects (RE) -- assumes unobserved effects are uncorrelated with regressors
More efficient than FE when the assumption holds
Hausman test -- compare FE and RE estimates; reject RE if they differ systematically
Correlated Random Effects (CRE / Mundlak) -- include group means of time-varying regressors to relax the RE assumption
Two-way fixed effects (TWFE) -- caution with staggered treatment timing; use modern DiD estimators (Callaway & Sant'Anna 2021; Sun & Abraham 2021; de Chaisemartin & d'Haultfoeuille 2020)
Dynamic panels -- Arellano-Bond / Blundell-Bond GMM for short T, large N panels with lagged dependent variables
2.3 Time Series Methods
Stationarity -- test with Augmented Dickey-Fuller (ADF), Phillips-Perron, or KPSS tests before modeling
Cointegration -- Engle-Granger two-step or Johansen test for long-run relationships among I(1) variables
VAR/VECM -- Vector Autoregression for reduced-form dynamics; Vector Error Correction Model when cointegration is present
Report impulse response functions (IRFs) with confidence bands
Granger causality tests for directional relationships
ARCH/GARCH -- for modeling volatility clustering in financial time series
Local projections (Jorda, 2005) -- flexible alternative to VAR-based IRFs; robust to misspecification
2.4 Structural vs. Reduced-Form
Reduced-form -- estimates causal effects without specifying a full economic model; transparent, easy to interpret, but limited in counterfactual analysis
Structural estimation -- specifies and estimates an economic model (utility functions, production functions, equilibrium conditions); allows counterfactual simulations but depends on model assumptions
Many papers now combine both: reduced-form evidence for causal effects + structural model for welfare/counterfactual analysis
When using structural models, clearly state all assumptions, the estimation method (MLE, GMM, simulated method of moments), and provide model fit diagnostics
2.5 Experimental and Quasi-Experimental Designs
Randomized Controlled Trials (RCTs)
Pre-register on the AEA RCT Registry
Report CONSORT-style flow diagrams
Analyze by intention-to-treat (ITT) as the primary specification
Report treatment-on-the-treated (TOT/LATE) using random assignment as an instrument for take-up
Account for multiple hypothesis testing (Bonferroni, Benjamini-Hochberg, or Westfall-Young)
Regression Discontinuity Design (RDD)
Plot the raw data around the discontinuity
Use local polynomial regression (triangular kernel preferred)
Report estimates across multiple bandwidths (optimal bandwidth via Calonico, Cattaneo, & Titiunik 2014)
Run McCrary (2008) density test to check for manipulation
Test for covariate smoothness at the cutoff
Difference-in-Differences (DiD)
Plot pre-treatment trends for treatment and control groups
Include event study specification with leads and lags
Test for pre-trends (joint significance of pre-treatment coefficients)
For staggered adoption: use Callaway-Sant'Anna, Sun-Abraham, or Borusyak-Jaravel-Spiess estimators
3. Econometric Reporting
3.1 Regression Tables
Economics has highly specific conventions for presenting regression results:
Standard errors in parentheses below the coefficient estimate -- never report t-statistics in parentheses (some journals accept brackets for t-statistics; always clarify in a table note)
Significance stars -- use the standard notation:
* p < 0.10
** p < 0.05
*** p < 0.01
Include a note at the bottom of every table: "Standard errors in parentheses. * p < 0.10, ** p < 0.05, *** p < 0.01."
Multiple specifications -- present results across columns, progressively adding controls:
Column (1): Bivariate regression
Column (2): Add demographic controls
Column (3): Add fixed effects
Column (4): Preferred specification
Column (5): Robustness (alternative sample, specification, or estimator)
Bottom-panel statistics -- report at the bottom of each table:
Number of observations (N)
R-squared (or adjusted R-squared, or within R-squared for FE)
Fixed effects included (entity, time, entity x time)
Clustering level for standard errors
First-stage F-statistic (for IV regressions)
Dependent variable mean (useful for interpreting effect sizes)
Formatting
Use stargazer (R), esttab/estout (Stata), or fixest::etable (R) for consistent table generation
Align decimal points across columns
Use consistent number of decimal places (typically 3 for coefficients, 3 for standard errors)
Label variables with readable names, not variable codes
3.2 Example Table Format
Table 3: Effect of Minimum Wage on Employment
(1) (2) (3) (4)
OLS OLS FE IV
Log(minimum wage) -0.152** -0.134** -0.098* -0.215**
(0.061) (0.058) (0.052) (0.089)
Controls No Yes Yes Yes
State fixed effects No No Yes Yes
Year fixed effects No No Yes Yes
Observations 1,530 1,530 1,530 1,530
R-squared 0.043 0.127 0.891 0.887
First-stage F 23.4
Dependent variable mean 4.82 4.82 4.82 4.82
Notes: Standard errors clustered at the state level in parentheses.
* p < 0.10, ** p < 0.05, *** p < 0.01. The dependent variable is
log(employment). Column (4) instruments for log(minimum wage) using
the interaction of federal minimum wage changes with state-level
exposure.
3.3 Robust and Clustered Standard Errors
Always specify the type of standard errors -- never report uncorrected OLS standard errors without justification
Heteroskedasticity-robust (HC) -- use HC1 (Stata default) or HC2/HC3 for small samples
Cluster-robust standard errors -- cluster at the level of treatment variation:
State-level policy: cluster at state level
Individual-level treatment in a school: cluster at school level
Firm-level event: cluster at firm level
Two-way clustering -- when error correlation occurs along two dimensions (e.g., firm and year), use Cameron, Gelbach, & Miller (2011) two-way clustering
Few clusters problem -- if fewer than ~40 clusters, standard cluster-robust errors are unreliable:
Use wild cluster bootstrap (Cameron, Gelbach, & Miller 2008)
Or use the effective number of clusters correction (Imbens & Kolesar 2016)
Conley standard errors -- for spatial correlation in cross-sectional data
3.4 IV-Specific Reporting
First-stage regression -- always report; include the first-stage F-statistic
Staiger-Stock rule of thumb -- F > 10 for a single endogenous regressor; for multiple instruments, use the Stock-Yogo critical values or the effective F-statistic (Olea & Pflueger 2013)
Weak instrument diagnostics -- report Anderson-Rubin confidence intervals if the instrument is potentially weak
Overidentification test -- Hansen J test when there are more instruments than endogenous regressors
Reduced-form -- present the reduced-form (regress outcome on instrument) alongside 2SLS; if the reduced form is not significant, the IV result is suspect
Report both OLS and IV -- compare to assess the direction and magnitude of endogeneity bias
Leave-one-out analysis (drop one state/country/sector at a time)
Sensitivity to outliers (winsorize at 1%/99%, Cook's distance)
Bounding exercises (Oster 2019 for omitted variable bias; Conley, Hansen, & Rossi 2012 for IV)
5. Data Standards
5.1 Reproducibility Requirements
Replication package -- must include:
All code (numbered or clearly ordered scripts)
Raw data or clear instructions for obtaining restricted/proprietary data
A master script that runs everything from raw data to final tables and figures
A README following the AEA template (data sources, computational requirements, instructions)
Software documentation -- specify exact versions:
Stata: version number + all user-written packages with version (ssc install ...)
R: sessionInfo() or renv lockfile
Python: requirements.txt with pinned versions
Runtime estimates -- state how long the code takes to run and on what hardware
Random seed -- set and document seeds for any stochastic procedure (bootstrap, simulation, MCMC)
5.2 Code Standards
Stata .do files -- the traditional workhorse of empirical economics
Use version XX at the top of the master .do file
Use relative file paths from a project root
Comment extensively; use section headers
Log output: log using "output/main_results.log", replace
R scripts -- increasingly common, especially for visualization and newer econometric methods
Use R projects (.Rproj) for path management
Prefer fixest for fixed effects estimation (fast, flexible)
Use modelsummary or stargazer for table generation
Python -- growing presence for machine learning, NLP in economics, and large-scale data processing
Use linearmodels for panel data, statsmodels for core econometrics
Jupyter notebooks for exploration; .py scripts for production code
File organization:
project/
├── data/
│ ├── raw/ # Never modify raw data
│ ├── processed/ # Cleaned and merged datasets
│ └── README.md # Data dictionary and sources
├── code/
│ ├── 01_clean.do
│ ├── 02_merge.do
│ ├── 03_analysis.do
│ ├── 04_tables.do
│ └── 05_figures.do
├── output/
│ ├── tables/
│ └── figures/
├── paper/
│ └── manuscript.tex
└── README.md
5.3 FAIR Data Principles
Findable -- deposit data in a searchable repository with a DOI (openICPSR, Zenodo, Dataverse)
Accessible -- use open formats (CSV, Parquet) over proprietary ones (.dta is acceptable in economics but include CSV versions)
Interoperable -- use standard variable naming conventions; include a codebook
Reusable -- attach a clear license; document provenance and transformations
5.4 Common Datasets in Economics
Dataset
Coverage
Access
Common Uses
PSID (Panel Study of Income Dynamics)
US households, 1968-present
Restricted (free registration)
Income dynamics, intergenerational mobility
CPS (Current Population Survey)
US labor force, monthly
Public (IPUMS-CPS)
Labor economics, wage analysis
ACS (American Community Survey)
US demographics, annual
Public (IPUMS-USA)
Regional analysis, immigration, housing
World Bank WDI
Country-level development indicators
Public
Development economics, cross-country analysis
Penn World Table
Cross-country GDP, productivity
Public
Growth economics, international comparisons
Compustat / CRSP
US firm financials / stock returns
Licensed (WRDS)
Corporate finance, asset pricing
NLSY (National Longitudinal Survey of Youth)
US youth cohorts (1979, 1997)
Public/restricted
Returns to education, labor market outcomes
DHS (Demographic and Health Surveys)
Developing countries, health + demographics
Public (registration)
Health economics, development
LISS / Understanding Society
Dutch / UK household panels
Public (registration)
Behavioral economics, labor, health
World Values Survey
Cross-country attitudes and values
Public
Institutional economics, culture
6. Additional Conventions
6.1 Figures
Publication quality -- use vector formats (PDF, EPS) for journal submission; minimum 300 DPI for rasters
Binned scatter plots -- the workhorse visualization in applied micro; use binscatter (Stata) or binsreg (Stata/R) for proper implementation with controls
Event study plots -- plot coefficients and 95% confidence intervals relative to treatment timing; normalize the period before treatment to zero
RDD plots -- show raw data with a fitted polynomial on each side of the cutoff; include the confidence interval
Coefficient plots -- preferred over tables for presenting many estimates; use coefplot (Stata) or ggplot2::geom_pointrange (R)
Color and accessibility -- use colorblind-friendly palettes; ensure figures are legible in grayscale
6.2 JEL Classification Codes
Include 2-3 JEL codes with every paper. Common codes:
C -- Mathematical and Quantitative Methods (C23: Panel Data Models; C26: IV)
D -- Microeconomics (D12: Consumer Economics; D72: Political Processes)
E -- Macroeconomics (E24: Employment; E52: Monetary Policy)
F -- International Economics (F13: Trade Policy; F31: Foreign Exchange)
G -- Financial Economics (G12: Asset Pricing; G21: Banks)
H -- Public Economics (H23: Externalities; H75: State and Local Government)
I -- Health, Education, Welfare (I12: Health Behavior; I26: Returns to Education)
L -- Industrial Organization (L11: Market Structure; L86: IT Services)
O -- Economic Development (O15: Human Resources; O33: Technological Change)
6.3 Authorship in Economics
Author order -- alphabetical ordering is the strong norm in economics (unlike most other social sciences)
Exceptions exist for very unequal contributions, but alphabetical is the default expectation
Acknowledgments -- thank seminar participants, conference discussants, referees, and editors by convention
Working paper circulation -- it is standard to circulate working papers (NBER, SSRN, CEPR) well before journal publication; most economics papers are cited in working paper form for years
Checklist
Research Design
Identification strategy clearly stated and defended
Key assumptions explicitly listed and discussed
Threats to identification addressed
Pre-registration completed (if experimental or quasi-experimental)
IRB approval obtained (if human subjects involved)
JEL codes assigned
Econometric Reporting
Standard errors in parentheses below coefficients
Type of standard errors specified (robust, clustered, bootstrap)
Clustering level matches level of treatment variation
All variables defined in text before use in equations
Funding and conflicts of interest disclosed
References
Textbooks and Methodological Guides
Angrist, Joshua D., and Jorn-Steffen Pischke. 2009. Mostly Harmless Econometrics: An Empiricist's Companion. Princeton: Princeton University Press.
Angrist, Joshua D., and Jorn-Steffen Pischke. 2014. Mastering 'Metrics: The Path from Cause to Effect. Princeton: Princeton University Press.
Wooldridge, Jeffrey M. 2010. Econometric Analysis of Cross Section and Panel Data. 2nd ed. Cambridge, MA: MIT Press.
Cameron, A. Colin, and Pravin K. Trivedi. 2005. Microeconometrics: Methods and Applications. New York: Cambridge University Press.
Greene, William H. 2018. Econometric Analysis. 8th ed. New York: Pearson.
Cunningham, Scott. 2021. Causal Inference: The Mixtape. New Haven: Yale University Press. (https://mixtape.scunning.com/)
Huntington-Klein, Nick. 2022. The Effect: An Introduction to Research Design and Causality. Boca Raton: Chapman & Hall/CRC. (https://theeffectbook.net/)
Stock, James H., and Mark W. Watson. 2020. Introduction to Econometrics. 4th ed. New York: Pearson.
Key Methodology Papers
Callaway, Brantly, and Pedro H. C. Sant'Anna. 2021. "Difference-in-Differences with Multiple Time Periods." Journal of Econometrics 225 (2): 200-230.
Sun, Liyang, and Sarah Abraham. 2021. "Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects." Journal of Econometrics 225 (2): 175-199.
Calonico, Sebastian, Matias D. Cattaneo, and Rocio Titiunik. 2014. "Robust Nonparametric Confidence Intervals for Regression-Discontinuity Designs." Econometrica 82 (6): 2295-2326.
Oster, Emily. 2019. "Unobservable Selection and Coefficient Stability: Theory and Evidence." Journal of Business & Economic Statistics 37 (2): 187-204.
Cameron, A. Colin, Jonah B. Gelbach, and Douglas L. Miller. 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors." Review of Economics and Statistics 90 (3): 414-427.
Conley, Timothy G., Christian B. Hansen, and Peter E. Rossi. 2012. "Plausibly Exogenous." Review of Economics and Statistics 94 (1): 260-272.
McCrary, Justin. 2008. "Manipulation of the Running Variable in the Regression Discontinuity Design: A Density Test." Journal of Econometrics 142 (2): 698-714.
Olea, Jose Luis Montiel, and Carolin Pflueger. 2013. "A Robust Test for Weak Instruments." Journal of Business & Economic Statistics 31 (3): 358-369.