paper-claim-audit
Installation
SKILL.md
Paper Claim Audit: Zero-Context Evidence Verification
🔒 Do not wrap this skill in
/loop,/schedule, orCronCreate. It is verdict-bearing — it judges paper-to-evidence fidelity with a deliberately zero-context fresh reviewer. Re-firing that verdict on a wall-clock timer adds no new signal (it changes only when the paper or results change). Schedule the external wait that precedes it — paper draft ready → then audit once. Seeshared-references/external-cadence.md.
Verify that every claim in the paper matches raw evidence for: $ARGUMENTS
Why This Exists
The executor writes experiments AND writes the paper. It "knows" what the results should be. This creates confirmation bias:
- Rounding 84.7% up to 85.3%
- Reporting best seed instead of average
- Citing metrics from a different experiment config
- Claiming "improves by 15%" when the delta is actually 12.8%