paper-autoraters

Installation

SKILL.md

Paper Autoraters (App. F.3)

Faithful implementation of the four LLM-as-judge autoraters used in PaperOrchestra (Song et al., 2026, arXiv:2604.05018, §5 and App. F.3).

These are the metrics the paper uses to demonstrate that PaperOrchestra beats single-agent and AI-Scientist-v2 baselines. Use them to:

Score a generated paper against a ground-truth paper.
Compare two paper-writing pipelines side-by-side.
Validate your own host-agent execution of the paper-orchestra pipeline.

The four autoraters

Installs

35

Repository

ar9av/paperorchestra

GitHub Stars

603

First Seen

Apr 14, 2026

Security Audits

Gen Agent Trust HubPass

paper-autoraters — ar9av/paperorchestra