AGENT DESKTOP
OBSERVE. DECIDE. ACT.
agent-desktop is a native desktop automation CLI designed for AI agents, built with Rust. It gives structured access to any application through OS accessibility trees — no screenshots, no pixel matching, no browser required.
Architecture
Key Features
Native Rust CLI : Fast, single binary, no runtime dependencies
: Fast, single binary, no runtime dependencies C-ABI cdylib ( libagent_desktop_ffi ): Load once from Python / Swift / Go / Ruby / Node / C instead of forking the CLI per call
( ): Load once from Python / Swift / Go / Ruby / Node / C instead of forking the CLI per call 53 commands : Observation, interaction, keyboard, mouse, notifications, clipboard, window management
: Observation, interaction, keyboard, mouse, notifications, clipboard, window management Progressive skeleton traversal : 78–96% token reduction on dense apps via shallow overview + targeted drill-down
: 78–96% token reduction on dense apps via shallow overview + targeted drill-down Snapshot & refs : AI-optimized workflow using deterministic element references ( @e1 , @e2 )
... continue reading