agent-browser

Installation
Summary

Playwright-based browser automation with element refs for AI agents, supporting navigation, interaction, screenshots, and video recording.

  • Provides 6 core functions: open (navigate with config), snapshot (refresh element refs), interact (click/fill/drag/upload/scroll), screenshot, execute (JavaScript), and close
  • Element interaction uses simple @e ref system that invalidates after navigation, requiring re-snapshot calls to maintain accurate selectors
  • Supports video recording with optional cursor indicator, proxy routing, file uploads, drag-and-drop, and custom JavaScript execution
  • Session-based workflow persists browser state across sequential commands; always start with --session new and reuse the returned session ID
SKILL.md

Agentic Browser

Browser automation for AI agents via inference.sh. Uses Playwright under the hood with a simple @e ref system for element interaction.

Agentic Browser

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Open a page and get interactive elements
belt app run agent-browser --function open --input '{"url": "https://example.com"}' --session new

Core Workflow

Related skills

More from inferen-sh/skills

Installs
GitHub Stars
500
First Seen
agent-browser — inferen-sh/skills