skills/smithery.ai/tcga-bulk-data-preprocessing-with-omicverse

tcga-bulk-data-preprocessing-with-omicverse

SKILL.md

TCGA Bulk Data Preprocessing with OmicVerse

Overview

Use this skill for loading TCGA data from GDC downloads, building normalised expression matrices, attaching clinical metadata, and running survival analyses through ov.bulk.pyTCGA.

Instructions

1. Gather required downloads

Confirm the user has three items from the GDC Data Portal:

  • gdc_sample_sheet.<date>.tsv — the sample sheet export
  • Decompressed gdc_download_xxxxx/ directory with expression archives
  • clinical.cart.<date>/ directory with clinical XML/JSON files

2. Initialise the TCGA helper

import omicverse as ov
import scanpy as sc
ov.plot_set()
Installs
2
First Seen
Mar 20, 2026