evaluate-multimodal

Pass

Audited by Gen Agent Trust Hub on Mar 23, 2026

Risk Level: SAFE

Full Analysis

[SAFE]: The skill provides instructional content and code snippets that adhere to safe practices for evaluating multimodal models.
[EXTERNAL_DOWNLOADS]: The skill references the langwatch Python package and @langwatch/scenario Node.js package, which are legitimate vendor resources from the author.

Audit Metadata

Risk Level

SAFE

Analyzed

Mar 23, 2026, 11:45 PM