tooluniverse-cancer-genomics-tcga

Installation
SKILL.md

Cancer Genomics / TCGA Analysis

TCGA analysis starts with: what cancer type? what data type? Build your cohort FIRST (GDC filters), then analyze. Don't query mutations without defining the cohort — pan-cancer counts from GDC_get_mutation_frequency are uninformative without cancer-type context. A mutation frequency of 10% in one cancer type may be 0.5% in another; always specify project_id. Survival analysis (Kaplan-Meier) is hypothesis-generating in retrospective TCGA data — always report sample size and p-value, and note that TCGA cohorts are not treatment-stratified.

LOOK UP DON'T GUESS: never assume TCGA project IDs, NCIt codes, or gene coordinates — use GDC_list_projects to confirm project IDs and Progenetix_list_filtering_terms for NCIt codes.

Systematic TCGA/GDC analysis: define cohorts, retrieve clinical data, profile somatic mutations, query copy number variations, run survival analysis, and interpret variants with OncoKB.

When to Use

  • "What is the mutation frequency of TP53 in TCGA-BRCA?"
  • "Get survival data for TCGA-LUAD patients"
  • "Find clinical data for breast cancer cases in GDC"
  • "Which TCGA projects have KRAS G12C mutations?"
  • "Show CNV amplifications of EGFR in glioblastoma"
  • "Annotate BRAF V600E for clinical significance in melanoma"
Related skills

More from mims-harvard/tooluniverse

Installs
84
GitHub Stars
1.3K
First Seen
Mar 25, 2026