AI tools are changing how we use software — but they still cover only a fraction of our work. Our daily tasks are scattered across browsers, desktop apps, terminals, and messaging tools — each with its own interface and habits, disconnected from each other.
Understudy is a teachable desktop agent. It operates your computer like a human colleague — GUI, browser, shell, file system, all in one local runtime. You show it a task once, it extracts the intent (not just the coordinates), remembers the successful path, discovers faster execution routes over time, and eventually handles routine work on its own. No API integrations required. No workflow builders. Just demonstrate once.
The Five Layers
Understudy is designed as a layered progression — the same journey a new hire takes when they grow into a reliable colleague.
Day 1: Watches how things are done Week 1: Imitates the process, asks questions Month 1: Remembers the routine, does it independently Month 3: Finds shortcuts and better ways Month 6: Anticipates needs, acts proactively
That's why it's called Understudy — in theater, an understudy watches the lead, learns the role, and steps in when needed.
Each of the five layers maps to a stage of this journey:
Layer 1 ┃ Operate Software Natively Operate any app a human can — see, click, type, verify ────────╋────────────────────────────────────────────────────────────────────────────────── Layer 2 ┃ Learn from Demonstrations User shows a task once — agent extracts intent, validates, learns ────────╋────────────────────────────────────────────────────────────────────────────────── Layer 3 ┃ Crystallized Memory Agent accumulates experience from daily use, hardens successful paths ────────╋────────────────────────────────────────────────────────────────────────────────── Layer 4 ┃ Route Optimization Automatically discover and upgrade to faster execution routes ────────╋────────────────────────────────────────────────────────────────────────────────── Layer 5 ┃ Proactive Autonomy Notice and act in its own workspace, without disrupting the user
Current status: Layers 1-2 are implemented and usable today. Layers 3-4 are partially implemented. Layer 5 is still the long-term direction.
Every layer depends on the one below it. No shortcuts — the system earns its way up. Read the full story: Overview → | Chinese Overview → | Product Design →
... continue reading