run-agent-browser
Browser Automation with agent-browser
Drive the agent-browser CLI as the agent's hands inside a browser: open URLs, snapshot interactive refs, fill forms, click, extract DOM, screenshot, switch tabs, persist sessions, run headed/stealth, or dispatch through providers. This skill owns ad hoc, terminal-driven browser tasks where a human-style operator loop (observe → act → verify) is appropriate.
When to use this skill
Use this skill when:
- the user names
agent-browser,npx agent-browser,@refsnapshots,snapshot -i, or any agent-browser flag (--session-name,--profile,--headed,-p browserbase|browseruse|kernel|ios,--engine lightpanda) - the task is one-off browser automation done by the agent itself — log in, click, fill a form, scrape data, take a screenshot, capture page state
- the workflow needs deterministic DOM-grounded verification (
get url,get text,get value,is visible,diff snapshot) rather than asserted test code - multi-tab, popup, OAuth, or session-isolated flows must reuse one browser context across many commands
- hosted, mobile, geo, or anti-bot pressure requires
--headed, stealth, profiles, or a remote provider through the same CLI - another skill (
convert-url-to-nextjs,extract-saas-design) needs live browser evidence — DOM, screenshots, runtime metadata, asset URLs — captured and handed back
Do NOT use this skill when:
- the deliverable is TypeScript code on
@onkernel/sdkor Kernel Apps → usebuild-kernel-ts-sdk - the deliverable is a rebuilt Next.js project from a captured site → ownership stays with
convert-url-to-nextjs; this skill is only invoked for capture - the deliverable is a SaaS visual-system writeup → ownership stays with
extract-saas-design; this skill is only invoked for browser evidence - the task is static research, DevTools-first profiling, or anything that does not require an active browser context
More from yigitkonur/skills-by-yigitkonur
run-research
Use skill if you are answering one technical research question with current web evidence, Reddit practitioner experience, and source-backed synthesis, optionally fanned out across parallel subagents.
42publish-npm-package
Use skill if you are publishing to npm via GitHub Actions release workflow with trusted publishing, NPM_TOKEN, provenance, semantic-release, changesets, release-please, or fixing npm publish CI.
35init-agent-config
Use skill if you are creating, auditing, or migrating CLAUDE.md, AGENTS.md, or REVIEW.md instruction files, folder-scoped agent guidance, or cross-agent companion entrypoints.
34build-mcp-use-server
Use skill if you are building TypeScript MCP servers with mcp-use/server — server.tool, response helpers, ctx.auth, sessions, transports, widgets, Inspector, deploy.
34build-skills
Use skill if you are creating or substantially revising a Claude skill and need workspace-first evidence, remote comparison, and repo-fit synthesis before writing SKILL.md.
32extract-saas-design
Use skill if you are extracting design tokens (color, typography, spacing, components) from an existing SaaS dashboard URL, codebase, or HTML snapshot into structured documentation, not rebuilding it.
32