Midscene Browser Automation

Installation
SKILL.md

Midscene Browser Automation

Automate browser interactions using Midscene with Claude. This skill provides natural language control over a Chrome browser through command-line tools for navigation, interaction, data extraction, and screenshots.

Overview

This skill uses a CLI-based approach where Claude calls browser automation commands via bash. The browser stays open between commands for faster sequential operations and preserves browser state (cookies, sessions, etc.).

Key Features:

  • 🧠 Natural language understanding of page elements
  • 🎯 Intelligent element identification without CSS selectors
  • 👁️ Visual and semantic understanding of web pages
  • 🤖 AI-powered interactions and data extraction

Setup Verification

IMPORTANT: Before using any browser commands, you MUST check setup.json in this directory.

First-Time Setup Check

Related skills
Installs
GitHub Stars
221
First Seen