browser-automation-agent
Browser Automation with Agent-Browser
Agent-browser is a headless browser automation CLI designed specifically for AI agents. It provides fast browser control with deterministic element selection through accessibility tree snapshots, making it ideal for agent-driven web automation workflows.
When to use
- Use case 1: When the user asks to automate web interactions (fill forms, click buttons, navigate sites)
- Use case 2: When you need to capture screenshots or generate PDFs of web pages
- Use case 3: For web scraping tasks that require JavaScript rendering or complex interactions
- Use case 4: When building automation workflows that need deterministic element references
- Use case 5: For testing web applications with agent-driven scenarios
Required tools / APIs
- No external API required (runs locally)
- agent-browser: Headless browser CLI with Rust/Node.js implementation
- Chromium: Downloaded automatically during installation
Install options:
More from besoeasy/open-skills
news-aggregation
Aggregate and deduplicate recent news from multiple sources into concise topic summaries.
149free-geocoding-and-maps
Geocode addresses and get map data using free OpenStreetMap Nominatim API. Use when: (1) Converting addresses to coordinates, (2) Reverse geocoding coordinates to addresses, (3) Location-based features, or (4) Validating addresses.
63get-crypto-price
Fetch current and historical crypto prices and compute ATH or ATL over common time windows.
49anonymous-file-upload
Upload and host files anonymously using decentralized storage with Originless and IPFS.
47trading-indicators-from-price-data
Compute common trading indicators from OHLCV price data for analysis and strategy development.
46json-and-csv-data-transformation
Transform data between JSON, CSV, and other formats with filtering, mapping, and flattening. Use when: (1) Converting API responses to CSV, (2) Processing data pipelines, (3) Extracting specific fields, or (4) Flattening nested structures.
44