Back to AI Lab
ArXiv Paper

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Jianxiong Gao, Zhaoxi Chen, Xian Liu +7December 16, 2025

Summary

Presents LongVie 2, a world-model-style generator for ultra-long videos with explicit control signals. The model can condition on multimodal inputs and maintain temporal coherence over very long horizons, with a public project page for demos. This sits right at the frontier of ‘video world models’ that might eventually underpin simulation-heavy planning and agent training.

Related Content