Hackathon Judge — four-pass project scoring

Score a project submission with the methodology from Don't Let the LLM Pick a Number: independent passes collect bounded signed evidence; math computes the scores; a final pass turns the evidence into mentoring feedback for the team.

This skill is generic by design — there is nothing hackathon-specific in the math. "Hackathon" is the natural search term, but the same pipeline works for take-home interviews, demo-day judging, internal review boards, vendor pilots, and any project where someone built a thing and someone else needs to score it.

When to use this

A submission has both a codebase and an optional demo video (URL or transcript) — the four-pass shape gets the most value when both exist, but the skill degrades gracefully if only code is available.
The user wants a defensible numeric score per criterion, not a vibe verdict.
Multiple submissions need to be scored consistently (the formula is reproducible across runs and reviewers).
The user wants the team to receive grounded mentoring feedback, not a leaderboard rank.

If the user just wants quick feedback on a draft (no codebase), use what-works-feedback-judge. If they want to design their own scoring rubric for a non-project domain, use evidence-scoring.

The four passes

Each pass is a separate stage. They can be run together (full pipeline) or in isolation when iterating.

hackathon-judge

Hackathon Judge — four-pass project scoring

When to use this

The four passes