The Agent Skills Directory

[EXTERNAL_DOWNLOADS]: The skill instructs the user to install the ag2 Python package with openai and tracing extras using pip install. This is a standard requirement for accessing the evaluation tools and features described in the documentation.
[COMMAND_EXECUTION]: Provides code examples for running agent evaluations and pairwise comparisons. These commands use the official autogen.beta.eval modules and are used to facilitate benchmarking and leaderboard generation as intended by the skill.

ag2-eval-comparison