UI-TARS-desktop
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
About UI-TARS-desktop
TARS \* is a Multimodal AI Agent stack, currently shipping two projects: Agent TARS and UI-TARS-desktop:
Agent TARS is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product.
It primarily ships with a CLI and Web UI for usage. It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world MCP tools.
UI-TARS-desktop is an open-source project written primarily in TypeScript, with 38k stars on GitHub. It was last updated in July 2026.
npx @agent-tars/cli@latestUI-TARS-desktop vs. the alternatives
All browser & computer use →| Agent | Stars | Pricing | ||
|---|---|---|---|---|
| UI-TARS-desktop | 38k | TypeScript | Apache-2.0 | Open source |
| skyvern | 22k | Python | AGPL-3.0 | Open source |
| page-agent | 22k | TypeScript | MIT | Open source |
| midscene | 14k | TypeScript | MIT | Open source |
| nanobrowser | 13k | TypeScript | Apache-2.0 | Open source |
| Agent-S | 12k | Python | Apache-2.0 | Open source |
