Paper Claim Audit: Zero-Context Evidence Verification

🔒 Do not wrap this skill in /loop, /schedule, or CronCreate. It is verdict-bearing — it judges paper-to-evidence fidelity with a deliberately zero-context fresh reviewer. Re-firing that verdict on a wall-clock timer adds no new signal (it changes only when the paper or results change). Schedule the external wait that precedes it — paper draft ready → then audit once. See shared-references/external-cadence.md.

Verify that every claim in the paper matches raw evidence for: $ARGUMENTS

Why This Exists

The executor writes experiments AND writes the paper. It "knows" what the results should be. This creates confirmation bias:

Rounding 84.7% up to 85.3%
Reporting best seed instead of average
Citing metrics from a different experiment config
Claiming "improves by 15%" when the delta is actually 12.8%

paper-claim-audit

Paper Claim Audit: Zero-Context Evidence Verification

Why This Exists