auto-benchmark

Installation
SKILL.md

Auto Benchmark

A continuous, autonomous benchmarking system that monitors the competitive landscape, extracts insights from the latest research, proposes and runs improvement experiments, and keeps your solution ranked #1 — so engineers and researchers can focus on building product rather than running benchmarks manually.

System Overview

The system operates as a closed loop that runs on a schedule (daily, weekly, or on trigger):

┌─────────────────────────────────────────────────────────────────┐
│                      CONTINUOUS LOOP                            │
│                                                                 │
│  [1] Monitor       [2] Ingest         [3] Hypothesize           │
│  Competitors   →   Research       →   Improvements             │
│  & Leaderboards    Papers             from Gap                  │
│       ↑                                    ↓                    │
│  [6] Report    ←   [5] Promote    ←   [4] Experiment            │
│  Stakeholders      Winners            Autonomously              │
└─────────────────────────────────────────────────────────────────┘
Related skills

More from aviskaar/open-org

Installs
2
GitHub Stars
4
First Seen
Mar 18, 2026