Repository navigation

#

gui-agents

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript
11849
18 分钟前

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

630
1 个月前

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python
205
1 个月前

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

Python
25
9 个月前

Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.

22
5 个月前