Microbial diversity analysis: alpha/beta diversity metrics, OTU/ASV methods, taxonomy assignment, and community comparison. Use when analyzing 16S amplicon or microbiome data.
| Region | Primers | Length | Notes |
|---|---|---|---|
| V4 | 515F/806R | ~250 bp | Earth Microbiome Project standard |
| V3-V4 | 341F/785R | ~460 bp | Good taxonomic resolution |
| V1-V3 | 27F/534R | ~500 bp | Alternative target |
Other markers: 18S (eukaryotes), ITS (fungi), shotgun metagenomics (full functional + taxonomic profiling).
| Feature | OTUs (97%) | ASVs |
|---|---|---|
| Resolution | Species-level (approx) | Single-nucleotide |
| Reproducibility | Depends on algorithm |
| Consistent across studies |
| Tools | UCLUST, VSEARCH, CD-HIT | DADA2, Deblur, UNOISE3 |
ASVs are now recommended (Callahan et al., 2017). DADA2 learns error rates per nucleotide transition, tests whether each unique sequence could be explained as an error from a more abundant sequence.
| Method | Approach |
|---|---|
| Naive Bayes classifier | k-mer frequencies (QIIME2 classify-sklearn) |
| BLAST/VSEARCH | Sequence similarity top hit/consensus |
| Phylogenetic placement | pplacer, EPA-ng |
Reference databases: SILVA (~2M seqs, most comprehensive), Greengenes2 (genome phylogeny), RDP, GTDB (genome-based gold standard).
Measures diversity within a single sample:
Qualitative (presence/absence) vs quantitative (abundance-weighted).
Central data structure: rows = OTUs/ASVs, columns = samples, values = read counts. Simulate with Dirichlet-multinomial model per body site.