Skip to content

스킬 검색.../

Agent Skill Search Engine

검색

검색
카테고리
직업

About

About
Privacy
Terms

© 2026 Skills Pool. All rights reserved.

Biology Biopython | Skills Pool

스킬 파일

Biology Biopython

Bioinformatics with Biopython for sequence manipulation, file parsing, BLAST, and phylogenetics. Use when working with DNA/RNA/protein sequences or biological databases.

aiming-lab11,332 스타2026. 3. 31.

직업
카테고리: 생물정보학

스킬 내용

Biopython Bioinformatics Best Practice

Sequence Manipulation

Create sequences: from Bio.Seq import Seq; seq = Seq("ATGCGA")
Complement: seq.complement(); Reverse complement: seq.reverse_complement()
Transcription: seq.transcribe() (DNA to RNA)
Translation: seq.translate() (DNA/RNA to protein)
GC content: from Bio.SeqUtils import gc_fraction; gc_fraction(seq)
Molecular weight: from Bio.SeqUtils import molecular_weight

File Parsing (SeqIO)

Read FASTA: for rec in SeqIO.parse("file.fasta", "fasta"): ...
Read GenBank: for rec in SeqIO.parse("file.gb", "genbank"): ...
Read single record: rec = SeqIO.read("file.fasta", "fasta")
Write sequences: SeqIO.write(records, "output.fasta", "fasta")
Convert formats:

관련 스킬

빠른 설치

Biology Biopython

npx skillvault add aiming-lab/aiming-lab-autoresearchclaw-claude-skills-biology-biopython-skill-md

Skill 다운로드 저장소 열기

작성자: aiming-lab
스타: 11,332
업데이트: 2026. 3. 31.
직업

이 페이지의 내용

01Biopython Bioinformatics Best Practice

SeqIO.convert("input.gb", "genbank", "output.fasta", "fasta")

Index large files: idx = SeqIO.index("large.fasta", "fasta") for random access

BLAST Operations

Online BLAST: from Bio.Blast import NCBIWWW; result = NCBIWWW.qblast("blastn", "nt", seq)
Parse results: from Bio.Blast import NCBIXML; records = NCBIXML.parse(result)
Local BLAST: run via subprocess, parse XML output with NCBIXML
Always set Entrez.email before any NCBI access
Filter results by e-value (typically < 1e-5) and coverage

NCBI Database Access (Entrez)

Always set email: Entrez.email = "[email protected]"
Search: handle = Entrez.esearch(db="pubmed", term="query")
Fetch records: handle = Entrez.efetch(db="nucleotide", id="ID", rettype="fasta")
Use API key for higher rate limits (10 req/s vs 3 req/s)
Respect NCBI rate limits; add delays between batch requests

Phylogenetics (Bio.Phylo)

Read trees: from Bio import Phylo; tree = Phylo.read("tree.nwk", "newick")
Draw trees: Phylo.draw(tree) or Phylo.draw_ascii(tree)
Supported formats: newick, nexus, phyloxml
Traverse clades: for clade in tree.find_clades(): ...
Calculate distances: tree.distance(clade1, clade2)

Structure Analysis (Bio.PDB)

Parse PDB: parser = PDBParser(); structure = parser.get_structure("id", "file.pdb")
Hierarchy: Structure > Model > Chain > Residue > Atom
Get atoms: iterate through structure.get_atoms()
Calculate distances: use atom coordinate vectors
For mmCIF files: use MMCIFParser() instead of PDBParser()

Common Pitfalls

Always handle SeqIO.parse as an iterator — it exhausts after one pass
Check sequence alphabet compatibility before operations
Large files: use SeqIO.index() not SeqIO.to_dict() to avoid memory issues
Set proper timeout for remote BLAST queries (can take minutes)
Validate parsed data — missing annotations are common in public databases

02

Sequence Manipulation

03File Parsing (SeqIO)

04BLAST Operations

05NCBI Database Access (Entrez)

06Phylogenetics (Bio.Phylo)

07Structure Analysis (Bio.PDB)

08Common Pitfalls

데이터 과학자

생물정보학

Nanoclaw Repl

操作并扩展NanoClaw v2，这是ECC基于claude -p构建的零依赖会话感知REPL。

생물정보학

Bioinformatics

Gateway to 400+ bioinformatics skills from bioSkills and ClawBio. Covers genomics, transcriptomics, single-cell, variant calling, pharmacogenomics, metagenomics, structural biology, and more. Fetches domain-specific reference material on demand.

NousResearch98.6k

생물정보학

Smart Explore

Token-optimized structural code search using tree-sitter AST parsing. Use instead of reading full files when you need to understand code structure, find functions, or explore a codebase efficiently.

thedotmack62.6k

생물정보학

Vector Database Engineer

Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar

생물정보학

Skin Health Analyzer

Analyze skin health data, identify skin problem patterns, assess skin health status. Supports correlation analysis with nutrition, chronic diseases, and medication data.

생물정보학

Scanpy

Scanpy is a scalable Python toolkit for analyzing single-cell RNA-seq data, built on AnnData. Apply this skill for complete single-cell workflows including quality control, normalization, dimensionality reduction, clustering, marker gene identification, visualization, and trajectory analysis.

데이터 과학자