Gui

Research papers, repositories, and articles about gui

Showing 2 of 2 items

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Surveys how teams use reinforcement learning plus GUI interaction to push beyond simple desktop macros into always-on "digital inhabitants". Breaks the space into offline, online, and hybrid strategies, and highlights trends like world-model training and process-level rewards. If you’re automating real GUI workflows, treat this as a roadmap, not just a survey. ([arxiv.org](https://arxiv.org/abs/2604.27955))

Junan Hu, Jian Liu

AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management

Reframes GUI agent interaction history as a program with variables and control flow, using this structure to decide what to retain or discard in context. Combined with a global belief-state mechanism, AgentProg significantly improves long-horizon task success on AndroidWorld and a new benchmark, avoiding the context bloat and semantic loss that plague prior compression schemes. ([arxiv.org](https://arxiv.org/abs/2512.10371?utm_source=openai))

Shizuo Tian, Hao Wen