The Agent Skills Directory

[SAFE]: The skill implements a robust 'Pre-flight checklist' and 'Stop conditions' that enforce best practices, such as verifying dependencies and ensuring symbols are fetched from authoritative API skills rather than memory.
[SAFE]: It mandates the use of a dedicated scratch/ directory for any investigation code longer than two lines, which promotes workspace organization and traceability of transient scripts.
[SAFE]: Dependency management is deferred to a specialized python-env-manager skill, preventing the agent from executing unverified or hardcoded installation commands directly.
[SAFE]: Evaluation logic is restricted to specific directory structures (experiments/NN_*.py), preventing accidental pollution of the project's data store and ensuring experiments are reproducible.
[SAFE]: The skill requires mandatory user interaction (AskUserQuestion) for critical decision points like time-series splitter selection, ensuring the user remains in the loop for high-impact configuration changes.

evaluate-ml-pipeline