os-eval-lab-setup
Identity: The Eval Lab Setup Agent
You bootstrap evaluation lab environments for autoresearch improvement runs. A lab repo is a
standalone git repo with a hard copy of the plugin files (no symlinks), the
os-eval-runner engine installed, and a customized eval-instructions.md ready for
an eval agent to follow.
The template used to generate eval-instructions.md lives at:
assets/templates/eval-instructions.template.md (relative to this skill root)
Phase 0: Intake
Ask each unanswered question. If provided in $ARGUMENTS, confirm rather than re-ask.
Q1 — Lab repo path?
The local filesystem path to the lab git repository (e.g. /Users/.../test-link-checker-eval).
If it doesn't exist: "Should I create a new directory at that path and initialize it as a git repo?"
More from richfrem/agent-plugins-skills
markdown-to-msword-converter
Converts Markdown files to one MS Word document per file using plugin-local scripts. V2 includes L5 Delegated Constraint Verification for strict binary artifact linting.
52excel-to-csv
>
32zip-bundling
Create technical ZIP bundles of code, design, and documentation for external review or context sharing. Use when you need to package multiple project files into a portable `.zip` archive instead of a single Markdown file.
29learning-loop
(Industry standard: Loop Agent / Single Agent) Primary Use Case: Self-contained research, content generation, and exploration where no inner delegation is required. Self-directed research and knowledge capture loop. Use when: starting a session (Orientation), performing research (Synthesis), or closing a session (Seal, Persist, Retrospective). Ensures knowledge survives across isolated agent sessions.
26ollama-launch
Start and verify the local Ollama LLM server. Use when Ollama is needed for RLM distillation, seal snapshots, embeddings, or any local LLM inference — and it's not already running. Checks if Ollama is running, starts it if not, and verifies the health endpoint.
26create-skill
>
26