Repository navigation

#

world-models

Mastering Diverse Domains through World Models

Python
2066
4 个月前

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python
1921
2 天前

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python
1854
8 个月前

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Python
839
10 个月前

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python
750
2 个月前

Dream to Control: Learning Behaviors by Latent Imagination

Python
548
4 年前

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python
402
2 天前

A curated list of world models for autonomous driving. Keep updated.

360
9 天前

DayDreamer: World Models for Physical Robot Learning

Jupyter Notebook
340
3 年前

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

332
20 天前

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python
330
1 个月前

ICCV 2025 | TesserAct: Learning 4D Embodied World Models

Python
320
16 天前

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python
277
2 个月前

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

266
1 年前

《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

HTML
230
9 个月前

A structured implementation of MuZero

Python
205
3 年前

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python
202
7 个月前

A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.

157
2 天前