Rigorous framework for statistical inference and efficiency in modern methodology

Use this skill when working on: asymptotic properties of estimators, influence functions, semiparametric efficiency, double robustness, variance estimation, confidence intervals, hypothesis testing, M-estimation, or deriving limiting distributions.

Efficiency Bounds

Semiparametric Efficiency Theory

Cramér-Rao Lower Bound: For any unbiased estimator, $$\text{Var}(\hat{\theta}) \geq \frac{1}{nI(\theta)}$$

where $I(\theta)$ is the Fisher information.

Semiparametric Efficiency Bound: The variance of the efficient influence function: $$V_{eff} = E[\phi^*(\theta_0)^2]$$

where $\phi^*$ is the efficient influence function (EIF).

Influence Function Notation: $IF(O; \theta, P)$ represents the influence of observation $O$ on parameter $\theta$ under distribution $P$: $$IF(O; \theta, P) = \lim_{\epsilon \to 0} \frac{T((1-\epsilon)P + \epsilon \delta_O) - T(P)}{\epsilon}$$

Semiparametric Variance: For RAL estimators, $$\sqrt{n}(\hat{\theta} - \theta_0) \xrightarrow{d} N(0, E[IF(O)^2])$$

Rigorous framework for statistical inference and efficiency in modern methodology

Efficiency Bounds

Semiparametric Efficiency Theory

Cramér-Rao Lower Bound: For any unbiased estimator, $$\text{Var}(\hat{\theta}) \geq \frac{1}{nI(\theta)}$$

where $I(\theta)$ is the Fisher information.

Semiparametric Efficiency Bound: The variance of the efficient influence function: $$V_{eff} = E[\phi^*(\theta_0)^2]$$

where $\phi^*$ is the efficient influence function (EIF).

Semiparametric Variance: For RAL estimators, $$\sqrt{n}(\hat{\theta} - \theta_0) \xrightarrow{d} N(0, E[IF(O)^2])$$

Estimand	Efficient Influence Function	Efficiency Bound
ATE	$\phi_{ATE} = \frac{A}{\pi}(Y-\mu_1) - \frac{1-A}{1-\pi}(Y-\mu_0) + \mu_1 - \mu_0 - \psi$	$V_{ATE} = E[\phi_{ATE}^2]$
NDE	Complex (VanderWeele & Tchetgen, 2014)	Higher than ATE
NIE	Complex (VanderWeele & Tchetgen, 2014)	Higher than ATE

Measure	Definition	Use
VC dimension	Max shattered set size	Classification
Covering number	$N(\epsilon, \mathcal{F}, \|\cdot\|)$	General classes
Bracketing number	$N_{[]}(\epsilon, \mathcal{F}, L_2)$	Entropy bounds
Rademacher complexity	$\mathcal{R}n(\mathcal{F}) = E[\sup{f \in \mathcal{F}}	\frac{1}{n}\sum_i \epsilon_i f(X_i)

Class	Description	Application
VC classes	Finite VC dimension	Classification functions
Smooth functions	Bounded derivatives	Regression estimators
Monotone functions	Single crossings	Distribution functions
Lipschitz functions	Bounded variation	M-estimators

Notation	Meaning	Example
$O_p(1)$	Bounded in probability	$\hat{\theta}_n = O_p(1)$
$o_p(1)$	Converges to 0 in probability	$\hat{\theta}_n - \theta_0 = o_p(1)$
$O_p(a_n)$	$X_n/a_n = O_p(1)$	$\hat{\theta}_n - \theta_0 = O_p(n^{-1/2})$
$o_p(a_n)$	$X_n/a_n = o_p(1)$	Remainder terms

Asymptotic Theory

Efficiency Bounds

Semiparametric Efficiency Theory

Asymptotic Theory

Efficiency Bounds

Semiparametric Efficiency Theory

Efficiency for Mediation Estimands

Empirical Process Theory

Key Concepts

Complexity Measures

Donsker Classes

Definition and Importance

Key Donsker Classes

Donsker Theorem Applications

Core Concepts

Why Asymptotics?

Fundamental Sequence

Modes of Convergence

Convergence in Probability ($\xrightarrow{p}$)

Convergence in Distribution ($\xrightarrow{d}$)

Almost Sure Convergence ($\xrightarrow{a.s.}$)

Stochastic Order Notation

Key Theorems

Laws of Large Numbers

Central Limit Theorem

Slutsky's Theorem

Continuous Mapping Theorem

Delta Method

M-Estimation Theory

Setup

Consistency Conditions

Asymptotic Normality Conditions

Standard Errors

Visualization Expert

Data Analyst

Huggingface Hub

Multi Reviewer Patterns

Dbt Transformation Patterns

Startup Financial Modeling