Control Modes
Control modes define how AgentOS connects to the target machine. There are two:- Host Mode — AgentOS runs directly on the target as software. It uses OS-level APIs for screenshots, input, and window management. This is the most common setup.
- Companion Mode — AgentOS runs on a separate device and controls the target through hardware (USB, Bluetooth, HDMI capture) or device bridges (ADB, IDB). No software installation on the target is needed.
Runtime Modes
Runtime modes define how AgentOS runs on the machine it’s installed on. There are two:- Standalone — AgentOS runs as a regular process in the user’s session. Install via
pip install askui-agent-os. Best for local development and testing. - OS Service — AgentOS runs as a Windows system service with SYSTEM privileges. Install via the Service. Best for CI/CD, headless VMs, and enterprise deployments.
Capabilities
Capabilities are the building blocks your agents use — screenshots, keyboard input, mouse control, window management, process management, and more. All capabilities are optimized for token cost efficiency, low latency, and cross-device communication. What’s available depends on the combination of control mode and runtime mode:| Host Mode (Standalone) | Host Mode (OS Service) | Companion Mode | |
|---|---|---|---|
| Screenshots | Yes | Yes | Yes (HDMI capture) |
| Keyboard & mouse | Yes | Yes | Yes (USB/Bluetooth HID) |
| Window & process management | Yes | Yes | — |
| RDP resilience | — | Yes | — |
| Logon screen & CTRL+ALT+DEL | — | Yes | — |
| Mobile devices (ADB/IDB) | — | — | Yes |
Administrator Rights (Windows)
AgentOS consists of two components: the AgentOS Service and the Remote Device Controller (internally called the Execution Engine in installer parameters). Each can run elevated (with administrator privileges) or non-elevated (standard user privileges). Elevation determines what your agent can interact with — Windows prevents non-elevated processes from sending input to elevated applications.Session 0 vs Interactive Session: Windows runs services in Session 0 — an isolated, non-interactive session with no desktop. Users log in to Interactive Sessions (Session 1, 2, …) where the desktop and GUI applications live. The AgentOS OS Service runs in Session 0 but attaches to the user’s interactive session to control the desktop. This is why it can keep automating even when no user is logged in or an RDP session disconnects.
Standalone
| Component | Default | Elevated |
|---|---|---|
| AgentOS | Not elevated (recommended) | Run with admin rights |
| Remote Device Controller | Not elevated (recommended) | Run with admin rights |
OS Service
| Component | Default | Configurable |
|---|---|---|
| AgentOS | Elevated (SYSTEM) | — |
| Remote Device Controller | Not elevated (recommended) | SERVICE_EXECUTION_ENGINE_ELEVATED=1 |
How They Fit Together
Think of it as two independent choices:- Control mode — How do I reach the target? Software on the target (Host) or hardware from outside (Companion).
- Runtime mode — How does AgentOS run? As a regular process (Standalone) or as a system service (OS Service).
Control Modes
Host Mode vs Companion Mode.
Runtime Modes
Standalone vs OS Service.
Capabilities
Full capability matrix.