Name: pLannotate Plasmid Annotation
Author: jaechang-hits

pLannotate Plasmid Annotation

Automatically annotate plasmid sequences with functional features (promoters, terminators, resistance genes, origins of replication, tags, fluorescent proteins) using BLAST-based detection against curated databases (Addgene, fpbase, SnapGene). Accepts FASTA or raw sequence; outputs annotated GenBank files, interactive HTML maps, and CSV feature tables. Handles circular topology correctly. Use for verifying synthetic plasmid construction, preparing Addgene submissions, sharing plasmid maps, or batch-annotating a cloning library.

jaechang-hits119 星标2026年2月18日

职业
分类: 生物信息学

Overview

pLannotate annotates plasmid sequences by running BLAST searches against a curated library of over 5,000 features sourced from Addgene, NCBI, and fpbase. It identifies promoters, terminators, antibiotic resistance genes, origins of replication, tags, and fluorescent proteins while correctly handling circular plasmid topology — avoiding split-feature artifacts that arise from naive linear alignment. Results are written as annotated GenBank files for downstream use in SnapGene, Benchling, or BioPython, as interactive HTML plasmid maps for sharing and review, and as CSV tables for programmatic filtering. Both a Python API and a command-line interface are provided; a Streamlit web app is also bundled for exploratory use.

When to Use

Annotating a plasmid sequence received from a collaborator or downloaded from Addgene with no accompanying map
Verifying that all expected elements (promoter, insert, resistance marker, origin) are present after assembly or mutagenesis
Preparing a GenBank submission or Addgene deposit that requires a complete feature table
Batch-annotating a library of synthetic constructs produced by combinatorial cloning
Generating a shareable interactive plasmid map (HTML) without requiring SnapGene or Benchling licenses

pLannotate Plasmid Annotation

jaechang-hits119 星标2026年2月18日

职业
分类: 生物信息学

Overview

When to Use

Annotating a plasmid sequence received from a collaborator or downloaded from Addgene with no accompanying map

Verifying that all expected elements (promoter, insert, resistance marker, origin) are present after assembly or mutagenesis

Preparing a GenBank submission or Addgene deposit that requires a complete feature table

Batch-annotating a library of synthetic constructs produced by combinatorial cloning

Generating a shareable interactive plasmid map (HTML) without requiring SnapGene or Benchling licenses

Parameter	Default	Range / Options	Effect
`linear`	`False`	`True`, `False`	Treat sequence as linear (`True`) or circular (`False`); circular mode handles split features at the origin correctly
`db`	`"addgene"`	`"addgene"`, `"fpbase"`, `"snapgene"`	Feature database to search; `addgene` is broadest (promoters, resistance genes, origins, tags); `fpbase` adds fluorescent protein variants; `snapgene` includes SnapGene-curated features
`min_len`	`0`	`0`–`500` bp	Minimum feature length in bp; increase to suppress short spurious matches
`blast_identity_threshold`	`95`	`70`–`100` %	Minimum BLAST % identity to report a hit; lower values detect diverged homologs but increase false positives
`--html` (CLI)	off	flag	Generate interactive HTML plasmid map alongside GenBank output
`--csv` (CLI)	off	flag	Write CSV feature table to the output directory
`--linear` (CLI)	off	flag	Treat input as linear sequence (default is circular)
`--file` / `--input` (CLI)	required	FASTA path	Input plasmid sequence in FASTA format

Output File	Format	Description
`plasmid_annotated.gb`	GenBank	Sequence with annotated features; importable into SnapGene, Benchling, Geneious, ApE, BioPython
`plasmid_map.html`	HTML	Self-contained interactive circular plasmid map (Bokeh); shareable without a server
`all_features.csv`	CSV	Tabular feature list with columns: Feature, Feature_type, start, end, strand, pct_identity, pct_query_cov, database
`high_confidence_features.csv`	CSV	Filtered subset with identity >= 95% and coverage >= 90%
`all_plasmids_features.csv`	CSV	Batch mode: aggregated features across all plasmids with a `plasmid` column

Problem	Cause	Solution
`FileNotFoundError: blastn not found`	BLAST+ not on PATH	Install via conda: `conda install -c bioconda blast`; or via package manager: `brew install blast` (macOS) / `apt install ncbi-blast+` (Linux)
`No features detected`	Sequence is too short, wrong database, or non-standard bases	Verify sequence length >= 500 bp; try a different `db` (e.g., `"fpbase"` for fluorescent protein vectors); check for ambiguous bases with `validate_plasmid()`
Annotations wrap incorrectly at position 0	Sequence treated as linear when it is circular	Set `linear=False` (default); this enables circular BLAST to catch features that span the sequence origin
HTML map renders blank	`bokeh` version mismatch	Upgrade: `pip install --upgrade bokeh`; pLannotate requires Bokeh >=2.4
Low identity hits for known features	Feature sequence has been mutated or codon-optimized	Lower `blast_identity_threshold` to 85–90%; add a note that these are diverged homologs
`MemoryError` or very slow annotation	Sequence > 50 kb or BLAST database not indexed	Split large sequences into sub-regions; ensure the internal pLannotate database index exists (reinstall if needed)
GenBank file not parsed by SnapGene	Non-standard feature type labels	Open in Geneious or BioPython first; check for special characters in feature qualifiers

pLannotate Plasmid Annotation

Overview

When to Use

pLannotate Plasmid Annotation

Overview

When to Use

Prerequisites

Quick Start

Workflow

Step 1: Load Plasmid Sequence

Step 2: Run BLAST-Based Annotation

Step 3: Filter Features by Quality Thresholds

Step 4: Export Annotated GenBank File

Step 5: Generate Interactive HTML Visualization

Step 6: Parse GenBank Output with BioPython

Step 7: Batch Annotate Multiple Plasmids

Key Parameters

Common Recipes

Recipe: Launch Web App for Interactive Use

Recipe: CLI Batch Annotation

Recipe: Compare Annotations Before and After Mutagenesis

Recipe: Export Feature Table to Excel with Conditional Formatting

Expected Outputs

Troubleshooting

References

Nanoclaw Repl

Bioinformatics

Smart Explore

Vector Database Engineer

Skin Health Analyzer

Scanpy