Repository navigation
world-models
- Website
- Wikipedia
Mastering Diverse Domains through World Models
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Mastering Atari with Discrete World Models
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving
Dream to Control: Learning Behaviors by Latent Imagination
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
A curated list of world models for autonomous driving. Keep updated.
DayDreamer: World Models for Physical Robot Learning
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.
[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
World Model based Autonomous Driving Platform in CARLA 🚗
《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞
A structured implementation of MuZero
Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"
A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.