agent-browser

Installation
Summary

Playwright-based browser automation with element refs and session persistence for AI agents.

  • Provides 6 core functions: open (navigate + configure), snapshot (refresh element refs), interact (click/fill/drag/upload/scroll), screenshot, execute (JavaScript), and close
  • Uses @e ref system for element targeting; refs invalidate after navigation and require re-snapshot to refresh
  • Supports video recording with optional cursor indicator, proxy routing, file uploads, and drag-and-drop interactions
  • Session-based workflow: open once with --session new, reuse session ID for subsequent commands, close to retrieve video or finalize
SKILL.md

Agentic Browser

Browser automation for AI agents via inference.sh. Uses Playwright under the hood with a simple @e ref system for element interaction.

Agentic Browser

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Open a page and get interactive elements
belt app run agent-browser --function open --input '{"url": "https://example.com"}' --session new

Core Workflow

Related skills

More from inference-sh/skills

Installs
663
GitHub Stars
432
First Seen
Feb 5, 2026