bio-protein-clustering-pangenome

Installation
SKILL.md

Bio Protein Clustering Pangenome

Cluster proteins into orthogroups and derive pangenome matrices.

Instructions

  1. Cluster proteins with MMseqs2 or ProteinOrtho.
  2. Build presence/absence matrix.
  3. Compute core/accessory/cloud/singleton partitions.
  4. Identify single-copy orthologs for phylogenetic analysis.
  5. Discriminate paralogs from orthologs in multi-copy gene families.
  6. Calculate pangenome statistics (completeness, orthogroup occupancy).

Quick Reference

Task Action
Run workflow Follow the steps in this skill and capture outputs.
Validate inputs Confirm required inputs and reference data exist.
Related skills
Installs
18
GitHub Stars
2
First Seen
Feb 19, 2026