browser-automation
Installation
SKILL.md
Browser Automation
Available Tools
- browser_act(instruction, starting_url?): Execute browser actions using natural language (click, type, scroll, select). Use
starting_urlto navigate to a page and act in a single call. - browser_get_page_info(url?, text?, tables?, links?): Get page structure and DOM data (fast, no AI). Use
urlto navigate first;text=Truefor full text,tables=Truefor table data,links=Truefor all links. - browser_manage_tabs(action, tab_index?, url?): Switch, close, or create browser tabs
- browser_save_screenshot(filename): Save current page screenshot to workspace
When to Use
Use browser automation when the task genuinely requires it:
- UI interactions: Filling forms, clicking buttons, navigating multi-step workflows
- Login-required pages: Accessing content behind authentication that APIs cannot reach
- Dynamic/JS-heavy pages: Content rendered client-side that plain HTTP requests can't capture
- Human-like browsing needed: Sites that block bots or require realistic interaction patterns
- Scraping structured data: When no API exists and the data must be extracted from rendered pages
Prefer web search or url_fetcher for general information lookup, news, or publicly accessible pages — browser automation is slower and heavier. Reserve it for tasks where simpler tools are insufficient.
Related skills
More from aws-samples/sample-strands-agent-with-agentcore
financial-news
Stock quotes, price history, financial news, and analysis
207code-interpreter
Test and prototype code in a sandboxed environment. Use for debugging, verifying logic, or installing packages.
139google-maps
Place search, directions, geocoding, and interactive maps
85wikipedia-search
Wikipedia article search and retrieval
68word-documents
Create, modify, and manage Word documents.
67arxiv-search
Search and retrieve scientific papers from ArXiv
54