Differential expression analysis using DESeq2 (R/Bioconductor) via subprocess. Input: --counts <counts.csv> --metadata <metadata.csv> --contrast <treatment vs control> --output <dir>. Output: deseq2_results.csv (log2FC, padj), MA_plot.png, volcano_plot.png.
Run DESeq2 on a raw count matrix to identify differentially expressed genes between conditions.
| Task | Input | Output |
|---|---|---|
| Two-group DE | counts.csv + metadata.csv | deseq2_results.csv, MA_plot.png |
| Multi-factor | counts.csv + metadata.csv (multiple cols) | deseq2_results.csv per contrast |
| Shrinkage | lfcShrink via apeglm | shrunken_results.csv |
python3 run_deseq2.py \
--counts counts_matrix.csv \
--metadata sample_metadata.csv \
--contrast "treatment:drug_vs_control" \
--output results/deseq2/