Linear Regression

Simple Linear Regression

Models the relationship between a continuous outcome (Y) and a single predictor (X):

Y = beta_0 + beta_1 * X + epsilon

beta_0: intercept (expected Y when X = 0)
beta_1: slope (change in Y per one-unit increase in X)
epsilon: random error (assumed normally distributed with constant variance)

Multiple Linear Regression

Y = beta_0 + beta_1X1 + beta_2X2 + ... + beta_k*Xk + epsilon

Each coefficient represents the change in Y per unit change in that predictor, holding all other predictors constant (adjusted effect).

Assumptions (LINE)

Linearity — relationship between predictors and outcome is linear. Check: residual vs fitted plot (should show no pattern), component-plus-residual plots.
Independence — observations are independent. Violated by: clustering, repeated measures, time series. Solution: mixed models, GEE.

Linear Regression

Simple Linear Regression

Models the relationship between a continuous outcome (Y) and a single predictor (X):

Y = beta_0 + beta_1 * X + epsilon

beta_0: intercept (expected Y when X = 0)
beta_1: slope (change in Y per one-unit increase in X)
epsilon: random error (assumed normally distributed with constant variance)

Multiple Linear Regression

Y = beta_0 + beta_1X1 + beta_2X2 + ... + beta_k*Xk + epsilon

Each coefficient represents the change in Y per unit change in that predictor, holding all other predictors constant (adjusted effect).

Assumptions (LINE)

Linearity — relationship between predictors and outcome is linear. Check: residual vs fitted plot (should show no pattern), component-plus-residual plots.
Independence — observations are independent. Violated by: clustering, repeated measures, time series. Solution: mixed models, GEE.

Regression Analysis

Linear Regression

Simple Linear Regression

Multiple Linear Regression

Assumptions (LINE)

Regression Analysis

Linear Regression

Simple Linear Regression

Multiple Linear Regression

Assumptions (LINE)

Diagnostics

Logistic Regression

Binary Logistic Regression

Odds Ratio Interpretation

OR vs RR

ROC Curve and Model Discrimination

Model Fit

Multinomial Logistic Regression

Ordinal Logistic Regression

Proportional Odds Model

Interpretation

Poisson Regression

Overdispersion

Negative Binomial Regression

Zero-Inflated Models

Model Building and Selection

Variable Selection Approaches

AIC and BIC

Multicollinearity

Definition

Detection

Consequences

Solutions

Interaction and Effect Modification

Interaction

Additive vs Multiplicative Interaction

Reporting Standards

Deep Research

Data Analyst

Academic Researcher

Data Scientist

Biopython

Binary Analysis Patterns