browser-use

Originally fromgregpr07/browser-use
Installation
Summary

Fast, persistent browser automation with session continuity across sequential agent commands.

  • Supports three browser modes: headless Chromium, real Chrome with profile support, and cloud-hosted remote browsers with proxy configuration
  • Includes 20+ command categories covering navigation, page inspection, interactions, data extraction, cookie management, JavaScript execution, and wait conditions
  • Offers cloud session management, local server tunneling via Cloudflare, and parallel subagent execution through remote sessions with task monitoring
  • Built-in Python integration for setting variables, accessing the browser object, and running scripts within the automation context
SKILL.md

Browser Automation with browser-use CLI

The browser-use command provides fast, persistent browser automation. A background daemon keeps the browser open across commands, giving ~50ms latency per call.

Prerequisites

browser-use doctor    # Verify installation

For setup details, see https://github.com/browser-use/browser-use/blob/main/browser_use/skill_cli/README.md

Core Workflow

  1. Navigate: browser-use open <url> — launches headless browser and opens page
  2. Inspect: browser-use state — returns clickable elements with indices
  3. Interact: use indices from state (browser-use click 5, browser-use input 3 "text")
  4. Verify: browser-use state or browser-use screenshot to confirm
  5. Repeat: browser stays open between commands
Related skills
Installs
80.8K
GitHub Stars
100.2K
First Seen
Jan 22, 2026
browser-use — browser-use/browser-use