Browser & Computer Use
Agents that operate web browsers and computers to automate real-world tasks.
Commercial products
Open-source index
01UI-TARS-desktopThe Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra02skyvernAutomate browser based workflows with AI03page-agentJavaScript in-page GUI agent. Control web interfaces with natural language.04midsceneAI-powered, vision-driven UI automation for every platform.05nanobrowserOpen-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Ope…06Agent-SAgent S: an open agentic framework that uses computers like a human07robotgoRobotGo, Go Native cross-platform RPA, GUI automation, Auto test and Computer use @vcaesar08llm-scraperTurn any webpage into structured data using LLMs09faraFara-7B: An Efficient Agentic Model for Computer Use10TuriX-CUAThis is the official website for TuriX Computer-use-Agent11open-computer-useAI computer use powered by open source LLMs and E2B Desktop Sandbox12RPAUi.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.
13ShowUI[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.14OpenAdaptOpen Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) /…15ghost-osFull computer-use for AI agents. Self-learning workflows. Native macOS. No screenshots required.16parchiYour AI friend right in your browser17openbrowser-aiOpenBrowser is a framework for intelligent browser automation. It combines direct CDP communication with a CodeAgent architecture, where th…18BrowserPilotOpen‑source alternative to Perplexity Comet, director.ai and firecrawl combined