confidence
Installation
SKILL.md
Confidence Assessment
Rate your confidence that the current work fully solves the stated requirement.
Multi-signal evaluation. A single LLM-confidence number is unreliable as a stand-alone gate (token probability ≠ correctness). This skill combines the LLM's dimensional scoring with deterministic rule checks the agent must run alongside. The final score is gated on BOTH passing.
Contents
- Mode Detection
- Assessment Dimensions
- For
planmode — multi-signal: LLM scoring + rule checks (89% cap on failure) - For
codemode - For
bug-analysismode
- For
- Output Format
- Score Thresholds
- Iteration Protocol (plan mode)
Related skills