bio-sequence-similarity
π―Skillfrom gptomics/bioskills
Compares and calculates sequence similarity metrics between biological sequences (DNA, RNA, protein) using alignment algorithms like BLAST, Smith-Waterman, or Needleman-Wunsch to quantify genetic r...
Installation
npx skills add https://github.com/gptomics/bioskills --skill bio-sequence-similaritySkill Details
Overview
# bioSkills
A collection of skills that guide AI coding agents (Claude Code, Codex, Gemini) through common bioinformatics tasks.
Project Goal
This repository provides AI agents with expert knowledge for bioinformatics workflows. Each skill contains code patterns, best practices, and examples that help agents generate correct, idiomatic code for common tasks.
Target users range from undergrads learning computational biology to PhD researchers processing large-scale data. The skills cover the full spectrum from basic sequence manipulation to advanced analyses like single-cell RNA-seq and population genetics.
Requirements
Python
- Python 3.9+
- biopython, pysam, cyvcf2, pybedtools, pyBigWig, scikit-allel, anndata
```bash
pip install biopython pysam cyvcf2 pybedtools pyBigWig scikit-allel anndata mygene
```
R/Bioconductor
Required for differential expression, single-cell, pathway analysis, and methylation skills.
```r
if (!require('BiocManager', quietly = TRUE))
install.packages('BiocManager')
BiocManager::install(c('DESeq2', 'edgeR', 'Seurat', 'clusterProfiler', 'methylKit'))
```
CLI Tools
```bash
# macOS
brew install samtools bcftools blast minimap2 bedtools
# Ubuntu/Debian
sudo apt install samtools bcftools ncbi-blast+ minimap2 bedtools
# conda
conda install -c bioconda samtools bcftools blast minimap2 bedtools \
fastp kraken2 metaphlan sra-tools bwa-mem2 bowtie2 star hisat2 \
manta delly cnvkit macs3 tobias
```
Installation
Claude Code
```bash
git clone https://github.com/your-username/bioSkills.git
cd bioSkills
./install-claude.sh # Install globally
./install-claude.sh --project /path/to/project # Or install to specific project
./install-claude.sh --list # List available skills
./install-claude.sh --validate # Validate all skills
./install-claude.sh --update # Only update changed skills
./install-claude.sh --uninstall # Remove all bio-* skills
```
Codex CLI
```bash
./install-codex.sh # Install globally
./install-codex.sh --project /path/to/project # Or install to specific project
./install-codex.sh --validate # Validate all skills
./install-codex.sh --update # Only update changed skills
./install-codex.sh --uninstall # Remove all bio-* skills
```
Gemini CLI
```bash
./install-gemini.sh # Install globally
./install-gemini.sh --project /path/to/project # Or install to specific project
./install-gemini.sh --validate # Validate all skills
./install-gemini.sh --update # Only update changed skills
./install-gemini.sh --uninstall # Remove all bio-* skills
```
Codex and Gemini installers convert to the Agent Skills standard (examples/ -> scripts/, usage-guide.md -> references/).
Skill Categories
| Category | Skills | Primary Tools | Description |
|----------|--------|---------------|-------------|
| sequence-io | 9 | Bio.SeqIO | Read, write, convert FASTA/FASTQ/GenBank and 40+ formats |
| sequence-manipulation | 7 | Bio.Seq, Bio.SeqUtils | Transcription, translation, motif search, sequence properties |
| database-access | 10 | Bio.Entrez, BLAST+, SRA toolkit, UniProt API | NCBI/UniProt queries, SRA downloads, BLAST, homology searches |
| alignment-files | 9 | samtools, pysam | SAM/BAM/CRAM viewing, sorting, filtering, statistics, validation |
| variant-calling | 13 | bcftools, cyvcf2, Manta, Delly, VEP, SnpEff | VCF/BCF calling, SVs, filtering, annotation, clinical interpretation |
| alignment | 4 | Bio.Align, Bio.AlignIO | Pairwise and multiple sequence alignment, MSA statistics, alignment I/O |
| phylogenetics | 5 | Bio.Phylo, IQ-TREE2, RAxML-ng | Tree I/O, visualization, ML inference with model selection, ultrafast bootstrap |
| **differential-expr