xprof-profiling-analysis
XProf Profiling Analysis
TPU/GPU training performance analysis. Primary workflow uses XProf MCP tools (xprof_*) for live queries; appendix sections provide deep domain knowledge for offline analysis and result interpretation.
Step 0: Connection Check (MUST do first)
Before any analysis, verify that XProf MCP tools are available:
-
Try calling
xprof_list_runs(). -
If it succeeds — skip to Step 1.
-
If the tool is not found — the plugin is not enabled. Tell the user:
XProf MCP tools are not available. Enable the plugin and restart Claude Code:
claude settings set enabledPlugins.xprof-profiling-analysis@primatrix-skills trueStop here — wait for the user to restart before continuing.
More from primatrix/skills
linear
Manage issues, projects & team workflows in Linear. Use when the user wants to read, create or updates tickets in Linear.
13exec-remote
Executes Python scripts, tests, or benchmarks on a provisioned remote cluster (GPU or TPU) using SkyPilot. Use this skill when the user asks to run code on GPU, TPU, or any "remote" cluster.
12session-recorder
Records the complete session content and logs it to a daily work directory with a dynamic filename based on the active CLI agent. Use this for automated progress tracking and documentation.
10lint-fix
Check and fix lint issues for changed Python files. Supports single commit, commit range, and unstaged/staged working tree changes. Use when the user wants to verify or fix lint compliance.
2gke-tpu
Manage GKE-based TPU workloads — create pods/jobs via kubectl, sync code, and run multi-process benchmarks. Use when the user wants to create/manage/run TPU workloads on GKE. Reads config from gke.toml in the current working directory.
1tpu-perf-model
Use when analyzing theoretical TPU v7x performance for a mathematical formula or comparing kernel performance against theoretical bounds. Trigger when the user asks about TPU performance modeling, roofline analysis, data flow optimization, or tiling strategy.
1