paper-claim-audit

Installation
SKILL.md

Paper Claim Audit: Zero-Context Evidence Verification

🔒 Do not wrap this skill in /loop, /schedule, or CronCreate. It is verdict-bearing — it judges paper-to-evidence fidelity with a deliberately zero-context fresh reviewer. Re-firing that verdict on a wall-clock timer adds no new signal (it changes only when the paper or results change). Schedule the external wait that precedes it — paper draft ready → then audit once. See shared-references/external-cadence.md.

Verify that every claim in the paper matches raw evidence for: $ARGUMENTS

Why This Exists

The executor writes experiments AND writes the paper. It "knows" what the results should be. This creates confirmation bias:

  • Rounding 84.7% up to 85.3%
  • Reporting best seed instead of average
  • Citing metrics from a different experiment config
  • Claiming "improves by 15%" when the delta is actually 12.8%
Installs
154
GitHub Stars
12.7K
First Seen
Apr 14, 2026
paper-claim-audit — wanshuiyin/auto-claude-code-research-in-sleep