skyvern-browser-automation

Installation
SKILL.md

Skyvern Browser Automation -- CLI Judgment Procedure

Skyvern uses AI to navigate and interact with websites. Every command below is a runnable skyvern <command> invocation.

When to Use This Skill

  • Use when you need AI-assisted browser automation for navigation, extraction, form filling, login flows, or reusable website workflows.
  • Use when deterministic selectors are unavailable and Skyvern's visual/a11y reasoning can identify page controls.
  • Use when a one-off browser task should become a repeatable workflow with run history and verification.

Step 1: Classify Your Task (ALWAYS do this first)

Classification Signal CLI Command Cost What Happens
Quick check (yes/no) "is the user logged in?" skyvern browser validate 1 LLM + screenshots Lightweight validation (2 steps max), returns boolean. Cheapest AI option.
Quick inspection "what does the page show?" skyvern browser extract 1 LLM + screenshots Dedicated extraction LLM + schema validation + caching.
Single action (known target) "click #submit" skyvern browser click/type 0 LLM Deterministic Playwright. No AI. Fastest.
Single action (unknown target) "click the submit button" skyvern browser act 2-3 LLM, no screenshots No screenshots in reasoning. Economy a11y tree. For visual targets, use hybrid mode (selector + intent).
Same-page multi-step "fill the form and submit" skyvern browser act or primitive chain 2-3 LLM or 0 LLM Use act when labels are clear. Use click/type/select directly when you know selectors.
Related skills
Installs
4
GitHub Stars
37.3K
First Seen
1 day ago