DreamX-World 1.0: General-Purpose Interactive World Model with 6DoF Camera Control

AMAP-ML (Alibaba Maps AI Lab)

Research official 3 src. ~1 min

DreamX-World is a general-purpose interactive world model that generates diverse, high-fidelity worlds from text or image prompts and allows users or agents to explore them via WASD-style 6DoF camera control. Trained on a mix of Unreal Engine data, gameplay footage, and real-world video, it supports 720P generation up to 7.5 seconds per clip and long-horizon rollouts up to one minute. Two variants are released under Apache 2.0: DreamX-World-5B-Cam (bidirectional, 5s) and DreamX-World-5B (autoregressive, long-horizon).

Why it matters

One of the first openly released general-purpose interactive world models capable of responding to fine-grained camera and event controls across indoor, urban, nature, sci-fi, and gaming domains. 264 upvotes on HuggingFace Daily Papers signals strong community interest. Combining RL-based training with geometry-guided memory advances the practicality of world models as simulation environments for downstream agents.

Importance: 3/5

Notable open-source interactive world model release; 264 upvotes on HF Daily Papers (+1 bump applied); advances agent simulation environments.

Sources