Repository navigation

#

spatial-intelligence

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python
3835
1 个月前

Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python
318
2 个月前

[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

Python
163
2 个月前

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Python
163
2 个月前

[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding

Jupyter Notebook
153
1 个月前

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

Python
132
18 天前

[CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation".

117
7 个月前

Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"

Python
74
23 天前
Python
36
12 天前

This is the official implementation of "LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences"

Python
23
14 天前

"Gradio" Interface for SpatialLM Model | A 3D Large Language Model for Structured Scene Understanding, Processing Point Cloud Data from Monocular Videos, RGBD Images, and LiDAR.

Python
11
5 个月前

SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.

Python
8
4 个月前

Benchmarking 3D and 4D World Models in the Real World

2
2 个月前

Trying out SpatialLM (SpatialLM: Large Language Model for Spatial Understanding). Impressed with results 💖

Jupyter Notebook
1
5 个月前