open-browser-use
Installation
SKILL.md
Open Browser Use
Overview
Open Browser Use connects an MV3 Chrome extension, a local native messaging host, a CLI, SDKs, and an optional stdio MCP server so agents can automate a real Chrome profile. It is not Codex.app-specific; adapt the commands, MCP config, and SDK examples to the agent runtime you are operating in.
Core Workflow
- Check setup with
open-browser-use pingorobu ping. If it fails because setup is missing, read references/installation.md. - Choose a unique browser session id for the current agent task before opening or claiming tabs. Prefer the surrounding runtime's conversation/session id when available; otherwise create a short unique id such as
obu-<task-slug>-<timestamp>. Reuse that same id for every Open Browser Use command in this task. - Name the current browser task group before opening or claiming tabs. Use a short task label followed by
- OBU; if no better task label is available, useTask - OBU. - Use the CLI for simple inspection or one-shot actions:
info,tabs,user-tabs,history,open-tab,navigate,cdp, andcall. - Use
open-browser-use run/obu runfor CLI-level multi-step orchestration when a small line-oriented action plan is enough and writing SDK code would be unnecessary. - If the surrounding agent runtime supports local MCP servers, configure
obu mcpand call the exposed browser tools directly. Use therun_action_planMCP tool for the same line-oriented orchestration from MCP. Read references/sdk-and-protocol.md. - Use the JavaScript, Python, or Go SDK for larger multi-step workflows, event subscriptions, richer control flow, or when the surrounding agent runtime already runs code. Read references/sdk-and-protocol.md.
- Before ending browser work, release or keep session tabs with
open-browser-use finalize-tabs --session-id "$OBU_SESSION_ID" --keep '<json-array>', the MCPfinalize_tabstool, or the SDKfinalizeTabs/finalize_tabs/FinalizeTabsmethod. - If communication fails after setup, read references/troubleshooting.md.