desktop-control

Installation
SKILL.md

Desktop-Control Driver

The orchestrator routed you here. Use these mechanics to execute your plan.

Drive native desktop GUI apps through upstream trycua/cua cua-driver: enumerate apps and windows, snapshot accessibility trees, click/type/scroll by element_index or pixel coordinates, and verify by re-snapshot -- all without bringing the target to the foreground.

When to use

  • Automating a native desktop app (Finder, Notepad, System Settings, native editors)
  • Driving native dialogs and security/permission sheets that no DOM or PTY can reach
  • Visual QA of native UI: per-window screenshots, accessibility-tree assertions

If the target is a terminal TUI, use tuistory or true-input. If it is a web page or an Electron app, use agent-browser -- CDP beats accessibility trees for anything Chromium-based.

Platform support

Installs
4
GitHub Stars
92
First Seen
12 days ago
desktop-control — factory-ai/factory-plugins