review-model-performance
Pass
Audited by Gen Agent Trust Hub on Jun 20, 2026
Risk Level: SAFE
Full Analysis
- [COMMAND_EXECUTION]: Uses standard shell commands like
findandlsto identify project configuration files (tile.json) and evaluation scenarios. - [COMMAND_EXECUTION]: Orchestrates the
tesslCLI to manage authentication (tessl whoami), generate test scenarios, and execute model comparison jobs. - [EXTERNAL_DOWNLOADS]: Interacts with remote services via the CLI (
tessl scenario download,tessl eval run) and provides monitoring links to thetessl.iodashboard. - [EXTERNAL_DOWNLOADS]: Suggests expanding functionality by installing the
tessl-labs/eval-improveskill from the platform's official labs repository.
Audit Metadata