ml-model-evaluation

Installation
SKILL.md

Ml Model Evaluation

Overview

Use this skill to evaluate models with decision-grade evidence across aggregate and high-risk segments.

Scope Boundaries

  • Use this skill when the task matches the trigger condition described in description.
  • Do not use this skill when the primary task falls outside this skill's domain.

Shared References

  • Threshold and segmentation rules:
    • references/threshold-and-segmentation-rules.md

Templates And Assets

  • Evaluation report template:
    • assets/evaluation-report-template.md

Inputs To Gather

  • Dataset splits and baseline/candidate definitions.
Related skills

More from kentoshimizu/sw-agent-skills

Installs
5
GitHub Stars
5
First Seen
Feb 28, 2026