Repository navigation
world-model
- Website
- Wikipedia
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
Collect some World Models for Autonomous Driving (and Robotic) papers.
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
Build, evaluate and train General Multi-Agent Assistance with ease
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.
[ICCV 2025] Aether: Geometric-Aware Unified World Modeling
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
A skill-based platform for ROS v.2 with knowledge representating, planning and reasoning
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
DeepVerse: 4D Autoregressive Video Generation as a World Model
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
[NeurIPS 2024] Agent Planning with World Knowledge Model
[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO
A general AI agent framework that can be adapted to various tasks and environments.