confidence

Installation

SKILL.md

Confidence Assessment

Rate your confidence that the current work fully solves the stated requirement.

Multi-signal evaluation. A single LLM-confidence number is unreliable as a stand-alone gate (token probability ≠ correctness). This skill combines the LLM's dimensional scoring with deterministic rule checks the agent must run alongside. The final score is gated on BOTH passing.

Contents

Mode Detection
Assessment Dimensions
- For plan mode — multi-signal: LLM scoring + rule checks (89% cap on failure)
- For code mode
- For bug-analysis mode
Output Format
Score Thresholds
Iteration Protocol (plan mode)

Related skills

More from mthines/agent-skills

Installs

11

Repository

mthines/agent-skills

First Seen

Apr 23, 2026

Security Audits

Gen Agent Trust HubPass