interacting-with-android-device
Installation
SKILL.md
Interacting with Android Device
Quick Start: MCP Tools
The android-device MCP server provides 6 tools for device interaction. These replace the previous shell scripts with proper XML parsing, structured dumpsys parsing, and native obstruction detection.
Available tools:
capture— Capture UI hierarchy XML and/or screenshot. Params:{ xml?: boolean, screenshot?: boolean }. Default: both.find_element— Find element bytextorcontent-desc, return coordinates with obstruction detection. Params:{ text: string }. Returns JSON with coordinates, bounds, and obstruction status.tap_at— Tap at specific coordinates, wait, capture screenshot. Params:{ x, y, waitSeconds? }.tap_element— Find, tap, and capture in one call (recommended). Params:{ text, waitSeconds? }. Auto-adjusts coordinates when obstructed.navigate— Navigation actions: home, back, app-drawer. Params:{ action, waitSeconds? }. Captures screenshot after action.input_text— Type text into the focused field. Params:{ text, clear? }. Setclear: trueto erase existing content first.
Use these MCP tools instead of raw ADB commands to save tokens, get structured results, and benefit from automatic obstruction detection.