benchmark-due-diligence
Benchmark Due Diligence
Take a benchmark the user envies — a founder, KOL, company, or product whose success looks suspiciously shiny — and produce a teardown that ends in "what this means for ME", not a neutral report. The deliverable answers three questions a balanced briefing never does: How much of this success is real vs marketing bubble? How much is replicable method vs luck/timing? And what, specifically, can the commissioner do with it?
This is the adversarial, decision-oriented cousin of deep-research. Where deep-research builds a trustworthy picture of the world, this skill assumes the picture is inflated until proven otherwise and converts the survivors into the commissioner's own moves.
CRITICAL: run inline, never context: fork
This skill is an orchestrator — it spawns parallel collection + verification agents (via the Workflow tool, or Task agents) and may invoke other skills (deep-research, osint-investigate, qcc). Subagents cannot spawn subagents or call skills. Setting context: fork would silently break the entire fan-out. Do not add a context field. (Same constraint osint-investigate documents — it's a hard runtime rule, not a preference.)
The one rule that protects the commissioner: two injection channels
Everything the agents see flows through exactly two channels. Keeping them separate is the single most important discipline in this skill:
| Channel | Content | Injected into |
|---|---|---|
| FACTS | Already-verified public facts about the benchmark (relationships, who-owns-what, the headline claim flagged ⚠️ to-verify) |
Every agent — collection, verification, synthesis |
| COMMISSIONER_CONTEXT | The commissioner's private reality — real resources, client names, strategic intent, what they can actually leverage | Only the final mapping agent (Phase 4) |