Agent Browser
Headless browser automation CLI optimized for AI agents with accessibility tree snapshots and ref-based element selection
Headless browser automation CLI optimized for AI agents with accessibility tree snapshots and ref-based element selection
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Agent Browser is a headless browser automation CLI purpose-built for AI agents. Unlike screenshot-based browser tools, it uses accessibility tree snapshots with reference-based element selection — giving agents a deterministic, efficient way to navigate web pages, fill forms, extract data, and automate multi-step workflows through the command line.
The workflow follows three steps: navigate to a URL and capture the page state as a structured JSON snapshot, parse element references from the output to identify interactive targets, then execute actions using those refs (click, fill, hover, etc.). After each action that changes the page, a new snapshot reveals updated elements and state. This snapshot-then-act cycle enables reliable automation even on complex, dynamic web applications.
Install globally via npm with
npm install -g agent-browser, then run agent-browser install to download the bundled Chromium instance. The CLI is immediately ready for use — no additional configuration or API keys required. Start with agent-browser open <url> to capture your first page snapshot.No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.