debate
debate
Prompt templates, context assembly rules, and synthesis format for structured multi-round debates between AI tools.
Arguments
Parse from $ARGUMENTS:
- topic: The debate question/topic (required)
- --proposer: Tool for the proposer role (claude, gemini, codex, opencode, copilot)
- --challenger: Tool for the challenger role (must differ from proposer)
- --rounds: Number of back-and-forth rounds (1-5, default: 2)
- --effort: Thinking effort applied to all tool invocations (low, medium, high, max)
- --model-proposer: Specific model for proposer (optional)
- --model-challenger: Specific model for challenger (optional)
Universal Rules
ALL participants (proposer AND challenger) MUST support claims with specific evidence (file path, code pattern, benchmark, or documented behavior). Unsupported claims from either side will be flagged by the other participant and noted in the verdict. This applies to every round.
More from agent-sh/agentsys
web-browse
Browse and interact with web pages headlessly. Use when agent needs to navigate websites, click elements, fill forms, read content, or take screenshots.
11discover-tasks
Use when user asks to \"discover tasks\", \"find next task\", \"prioritize issues\", \"what should I work on\", or \"list open issues\". Discovers and ranks tasks from GitHub, GitLab, local files, and custom sources.
9learn
Research any topic online and create learning guides. Use when user asks to 'learn about', 'research topic', 'create learning guide', 'build knowledge base', or 'study subject'.
9perf-benchmarker
Use when running performance benchmarks, establishing baselines, or validating regressions with sequential runs. Enforces 60s minimum runs (30s only for binary search) and no parallel benchmarks.
9deslop
Use when user wants to clean AI slop from code. Use for cleanup, remove debug statements, find ghost code, repo hygiene.
8perf-baseline-manager
Use when managing perf baselines, consolidating results, or comparing versions. Ensures one baseline JSON per version.
8