evaluate-multimodal

Pass

Audited by Gen Agent Trust Hub on Mar 23, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: The skill provides instructional content and code snippets that adhere to safe practices for evaluating multimodal models.
  • [EXTERNAL_DOWNLOADS]: The skill references the langwatch Python package and @langwatch/scenario Node.js package, which are legitimate vendor resources from the author.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 23, 2026, 11:45 PM