run-evals

Installation
SKILL.md

Skill: Run Evals

Run the OpenWork UI evaluation flows against a real Electron app — either on a Daytona cloud sandbox or a local instance.

When to use

  • User says "run evals on Daytona" or "run this flow on Daytona"
  • User wants to verify a UI change end-to-end
  • User wants to test the onboarding, session, or settings flows

Prerequisites

  • daytona CLI installed and logged in (daytona login)
  • Using the "Different AI" org (daytona organization use "Different AI")
  • The .devcontainer/ files exist in the repo

Workflow

Step 1: Create sandbox (if not running)

Related skills
Installs
2
GitHub Stars
15.2K
First Seen
Today