gemini-computer-use
Gemini 2.5 Computer Use browser automation with Playwright-based agent loops and safety confirmations.
- Implements a screenshot-to-action cycle: capture screen, send to Gemini, parse function calls, execute in Playwright, return results until task completion or turn limit
- Supports multiple browser options: bundled Chromium (default), Chrome/Edge channels via
COMPUTER_USE_BROWSER_CHANNEL, or custom executables like Brave - Includes safety confirmation workflow that prompts users before executing risky UI actions flagged by the model
- Provides action exclusion via
--excludeflag and recommends sandboxed profiles or containers for safe operation
Gemini Computer Use
Quick start
-
Source the env file and set your API key:
cp env.example env.sh $EDITOR env.sh source env.sh -
Create a virtual environment and install dependencies:
More from am-will/codex-skills
frontend-design
Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
1.3Kfrontend responsive design standards
Build responsive, mobile-first layouts using fluid containers, flexible units, media queries, and touch-friendly design that works across all screen sizes. Use this skill when creating or modifying UI layouts, responsive grids, breakpoint styles, mobile navigation, or any interface that needs to adapt to different screen sizes. Apply when working with responsive CSS, media queries, viewport settings, flexbox/grid layouts, mobile-first styling, breakpoint definitions (mobile, tablet, desktop), touch target sizing, relative units (rem, em, %), image optimization for different screens, or testing layouts across multiple devices. Use for any task involving multi-device support, responsive design patterns, or adaptive layouts.
1.2Kcontext7
|
1.2Kplanner
>
1.2Kread-github
|
1.2Kparallel-task
>
1.2K