Technology
IT之家
系统极客
AASTOCKS
STCN 证券时报网
+2
6 outlets
Monday, June 1, 2026

Qwen3.7-Plus powers Alibaba multimodal agentic AI model

Source: IT之家
Read original|BABA $125.40

TL;DR

AI-Summarizedfrom 6 sources

On June 2 Beijing time, Alibaba’s Qwen team announced Qwen3.7-Plus, a multimodal upgrade to its Qwen3.7 model positioned as a unified vision–language agent foundation. The model is available via Alibaba Cloud’s Bailian platform and Qwen Studio, supporting images, video, screen and web inputs for GUI and CLI automation.

About this summary

This article aggregates reporting from 6 news sources. The TL;DR is AI-generated from original reporting. Race to AGI's analysis provides editorial context on implications for AGI development.

6 sources covering this story|1 company mentioned

Race to AGI Analysis

Qwen3.7-Plus is Alibaba’s clearest statement yet that the next phase of AI will be agentic and multimodal, not just text‑chat. By explicitly framing the model as a “multimodal interactive hybrid agent” base, Qwen is targeting workflows where an AI has to see screens, understand complex visual layouts, reason over them, call tools, and then execute in either GUI or CLI environments. The benchmark tables SysGeek publishes show Qwen3.7-Plus trading blows with or surpassing leading Western models on a swath of visual reasoning and screen‑automation tasks, even where it trails some Opus or GPT variants on pure text coding. ([ithome.com](https://www.ithome.com/0/958/449.htm))

Strategically, this matters because it signals that top Chinese labs are no longer just chasing general benchmarks; they’re optimizing for end‑to‑end automation of real software workflows. That’s exactly the substrate you need for AI agents that can operate entire businesses or complex operations with minimal human oversight. For the West, it’s a reminder that competition is now about specialized agent performance and integration into tooling ecosystems (Claude Code, OpenClaw, Qwen Code, etc.), not just raw LM scores.

In the AGI context, Qwen3.7-Plus pushes the frontier of what’s practical in vision‑heavy, multi‑step tasks, especially in Chinese‑language and domestic enterprise environments. It narrows any remaining gap in multimodal agent capabilities and ensures that if AGI‑like systems emerge from agentic stacks, China will have competitive building blocks.

May advance AGI timeline

Who Should Care

InvestorsResearchersEngineersPolicymakers

Companies Mentioned

Alibaba
Alibaba
Cloud|China
Valuation: $391.2B
BABANYSE$125.40

Coverage Sources

IT之家
系统极客
AASTOCKS
STCN 证券时报网
StockStar
Qwen (product site)
IT之家
IT之家ZH
Read
系统极客
系统极客ZH
Read
AASTOCKS
AASTOCKS
Read
STCN 证券时报网
STCN 证券时报网ZH
Read
StockStar
StockStarZH
Read
Qwen (product site)
Qwen (product site)ZH
Read