On June 2 Beijing time, Alibaba’s Qwen team announced Qwen3.7-Plus, a multimodal upgrade to its Qwen3.7 model positioned as a unified vision–language agent foundation. The model is available via Alibaba Cloud’s Bailian platform and Qwen Studio, supporting images, video, screen and web inputs for GUI and CLI automation.
This article aggregates reporting from 6 news sources. The TL;DR is AI-generated from original reporting. Race to AGI's analysis provides editorial context on implications for AGI development.
Qwen3.7-Plus is Alibaba’s clearest statement yet that the next phase of AI will be agentic and multimodal, not just text‑chat. By explicitly framing the model as a “multimodal interactive hybrid agent” base, Qwen is targeting workflows where an AI has to see screens, understand complex visual layouts, reason over them, call tools, and then execute in either GUI or CLI environments. The benchmark tables SysGeek publishes show Qwen3.7-Plus trading blows with or surpassing leading Western models on a swath of visual reasoning and screen‑automation tasks, even where it trails some Opus or GPT variants on pure text coding. ([ithome.com](https://www.ithome.com/0/958/449.htm))
Strategically, this matters because it signals that top Chinese labs are no longer just chasing general benchmarks; they’re optimizing for end‑to‑end automation of real software workflows. That’s exactly the substrate you need for AI agents that can operate entire businesses or complex operations with minimal human oversight. For the West, it’s a reminder that competition is now about specialized agent performance and integration into tooling ecosystems (Claude Code, OpenClaw, Qwen Code, etc.), not just raw LM scores.
In the AGI context, Qwen3.7-Plus pushes the frontier of what’s practical in vision‑heavy, multi‑step tasks, especially in Chinese‑language and domestic enterprise environments. It narrows any remaining gap in multimodal agent capabilities and ensures that if AGI‑like systems emerge from agentic stacks, China will have competitive building blocks.

