pinchbench

Installation

SKILL.md

PinchBench Benchmark Skill

PinchBench measures how well LLM models perform as the brain of an OpenClaw agent. Results are collected on a public leaderboard at pinchbench.com.

cd <skill_directory>

# Run benchmark with a specific model
uv run benchmark.py --model anthropic/claude-sonnet-4

Installs

Repository

GitHub Stars

1.1K

First Seen

Mar 7, 2026

Security Audits

pinchbench — pinchbench/skill