Repository navigation

#

world-models

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python
2231
11 天前

Mastering Diverse Domains through World Models

Python
2186
11 天前

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python
1871
10 个月前

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python
1224
9 天前

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Python
841
1 年前

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python
760
3 个月前

Dream to Control: Learning Behaviors by Latent Imagination

Python
553
4 年前

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python
516
13 天前

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

389
9 天前

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python
374
2 个月前
Python
367
6 天前

DayDreamer: World Models for Physical Robot Learning

Jupyter Notebook
352
3 年前

ICCV 2025 | TesserAct: Learning 4D Embodied World Models

Python
334
2 个月前

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python
293
4 个月前

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

266
1 年前

《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

HTML
242
10 个月前