Name: Salmon — Fast RNA-seq Quantification
Author: jaechang-hits

Salmon — Fast RNA-seq Quantification

Ultra-fast RNA-seq transcript and gene-level quantification using quasi-mapping (no BAM required). Builds a k-mer index from a transcriptome FASTA, then quantifies reads in minutes. Outputs transcript-level TPM/count tables (quant.sf) with optional GC-bias and sequence-bias correction. Integrates directly with tximeta/tximport for DESeq2 or edgeR. Use STAR instead when a genome-aligned BAM is required for variant calling or visualization.

jaechang-hits119 estrellas14 abr 2026

Ocupación
Categorías: Bioinformática

Overview

Salmon quantifies transcript abundance from RNA-seq reads using quasi-mapping — matching reads to a k-mer index of the transcriptome without full genome alignment. This makes Salmon 20–50× faster than alignment-based tools while producing accurate TPM and estimated count values. Salmon corrects for sequence-specific bias (--seqBias), GC-content bias (--gcBias), and fragment length distribution automatically. Output quant.sf files integrate directly with tximeta (R) or pydeseq2 (Python) for differential expression analysis. For improved accuracy, decoy-aware indexing uses the full genome to identify spurious quasi-mappings.

When to Use

Performing fast RNA-seq quantification when you do not need a genome-aligned BAM file
Running large-scale RNA-seq studies where alignment speed is a bottleneck (Salmon is 20-50× faster than STAR + featureCounts)
Computing TPM and estimated counts from bulk RNA-seq for differential expression with DESeq2 or edgeR
Correcting for GC bias, fragment length, and sequence context bias with --gcBias --seqBias

Salmon — Fast RNA-seq Quantification

jaechang-hits119 estrellas14 abr 2026

Ocupación
Categorías: Bioinformática

Overview

When to Use

Performing fast RNA-seq quantification when you do not need a genome-aligned BAM file

Running large-scale RNA-seq studies where alignment speed is a bottleneck (Salmon is 20-50× faster than STAR + featureCounts)

Computing TPM and estimated counts from bulk RNA-seq for differential expression with DESeq2 or edgeR

Correcting for GC bias, fragment length, and sequence context bias with --gcBias --seqBias

Parameter	Default	Range/Options	Effect
`-l / --libType`	required	`A` (auto), `SF`, `SR`, `IU`, `IS`, `MS`, `MR`	Library strandedness; `A` auto-detects from first reads
`-p / --threads`	`1`	1–64	CPU threads; 8–16 is typical
`--gcBias`	off	flag	Correct for GC-content bias in fragment selection; recommended for most samples
`--seqBias`	off	flag	Correct for sequence-specific bias at read starts; recommended
`--validateMappings`	off	flag	Use selective alignment for improved accuracy; slight speed cost
`--numBootstraps`	`0`	0–200	Bootstrap replicates for uncertainty estimation; enables Sleuth/Swish
`--dumpCsvCounts`	off	flag	Dump raw counts to CSV alongside quant.sf
`-d / --decoys`	—	file	Decoy sequence list for decoy-aware indexing
`--rangeFactorizationBins`	`4`	1–8	Bins for range-factorization model; increases accuracy at small speed cost
`--skipQuant`	off	flag	Build index and exit; useful for cluster pipelines

Salmon — Fast RNA-seq Quantification

Overview

When to Use

Salmon — Fast RNA-seq Quantification

Overview

When to Use

Prerequisites

Quick Start

Workflow

Step 1: Download Transcriptome Reference

Step 2: Build Salmon Index

Step 3: Quantify Single-End Reads

Step 4: Quantify Paired-End Reads with Bias Correction

Step 5: Load and Summarize Quantification Output

Step 6: Aggregate to Gene Level and Run DESeq2

Key Parameters

Common Recipes

Recipe 1: Batch Quantify All Samples

Recipe 2: Add Salmon to a Snakemake Pipeline

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy