benchmark-due-diligence

Installation
SKILL.md

Benchmark Due Diligence

Take a benchmark the user envies — a founder, KOL, company, or product whose success looks suspiciously shiny — and produce a teardown that ends in "what this means for ME", not a neutral report. The deliverable answers three questions a balanced briefing never does: How much of this success is real vs marketing bubble? How much is replicable method vs luck/timing? And what, specifically, can the commissioner do with it?

This is the adversarial, decision-oriented cousin of deep-research. Where deep-research builds a trustworthy picture of the world, this skill assumes the picture is inflated until proven otherwise and converts the survivors into the commissioner's own moves.

CRITICAL: run inline, never context: fork

This skill is an orchestrator — it spawns parallel collection + verification agents (via the Workflow tool, or Task agents) and may invoke other skills (deep-research, osint-investigate, qcc). Subagents cannot spawn subagents or call skills. Setting context: fork would silently break the entire fan-out. Do not add a context field. (Same constraint osint-investigate documents — it's a hard runtime rule, not a preference.)

The one rule that protects the commissioner: two injection channels

Everything the agents see flows through exactly two channels. Keeping them separate is the single most important discipline in this skill:

Channel Content Injected into
FACTS Already-verified public facts about the benchmark (relationships, who-owns-what, the headline claim flagged ⚠️ to-verify) Every agent — collection, verification, synthesis
COMMISSIONER_CONTEXT The commissioner's private reality — real resources, client names, strategic intent, what they can actually leverage Only the final mapping agent (Phase 4)
Installs
53
GitHub Stars
1.2K
First Seen
9 days ago
benchmark-due-diligence — daymade/claude-code-skills