profile-kernel

Installation
SKILL.md

GPU Kernel Profiling

GPU Clock Protocol

Always lock clocks before profiling. Always run one profiling at a time.

bash scripts/lock-gpu-clock.sh           # before profiling
bash scripts/reset-gpu-clock.sh          # after done

Device Peaks

Shared profiling guidance should not hard-code a single device. Read the active device context from params.json, runtime profiling artifacts, and the configured peak-spec sources in src/mla_var3/conf/devices.json.

Use device-specific reference files in docs/devices/ only when the measurement context matters. Ridge point remains:

Related skills
Installs
35
First Seen
Apr 21, 2026