agent-browser
Browser Automation with agent-browser
NixOS: All
agent-browsercommands require the Nix prefix:nix run github:numtide/llm-agents.nix#agent-browser -- <command> [args...]Always pass
--headedon the first command to open a visible browser window.
Quick start
agent-browser --headed open <url> # Navigate (visible browser)
agent-browser set viewport 1920 1080 # Set 1080p resolution (always do this after open)
agent-browser snapshot -i # Get interactive elements with refs
agent-browser click @e1 # Click element by ref
agent-browser fill @e2 "text" # Fill input by ref
agent-browser screenshot page.png # Take screenshot
agent-browser close # Close browser
More from markus1189/nixos-config
sourcegraph-search
Search code using Sourcegraph CLI. Use when (re)searching codebases, finding implementation examples, analyzing code patterns
22elfeed
Search the user's Emacs elfeed RSS feed database containing curated feeds from Reddit, blogs, YouTube, GitHub releases, and newsletters. Use when the user asks about articles they've read, mentions RSS feeds or 'something I read', wants to research topics from their curated sources (programming, AI, security, NixOS, Emacs, etc.), or needs to triage unread items.
1telegram
Direct integration with Telegram Bot API for checking inbox messages and sending messages (text, photos, documents) to chats. Triggers when users request 'check Telegram inbox', 'check messages', 'send message to [chat]', or mention Telegram bot interactions. Requires TELEGRAM_BOT_TOKEN environment variable.
1plantuml
Generates diagrams from text using PlantUML. Use when creating sequence diagrams, class diagrams, activity diagrams, C4 architecture diagrams, Gantt charts, mind maps, ER diagrams, or any visual diagram. Triggers on: diagram requests, architecture visualization, UML, flowcharts, system design, data modeling.
1transcribe-audio
Transcribes audio files (mp3, wav, ogg, m4a, flac, webm) using Gemini API via Portkey, saves transcripts as markdown, and supports follow-up analysis. Use when the user asks to transcribe audio, summarize a meeting recording, check a voice note, extract action items from a recording, asks what was discussed in an audio file, or mentions processing audio files in any way.
1humanizing-prose
Guides writing prose that avoids common AI/LLM tells, and reviews existing text for AI tells with concrete reformulation suggestions. Use when the user asks to 'humanize' text, 'de-AI' writing, review text for AI tells, write naturally, avoid sounding like AI, make text sound human, or mentions AI detection concerns.
1