Repository navigation

#

scene-understanding

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python
3833
1 个月前

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1493
1 个月前

Lightweight models for real-time semantic segmentationon PyTorch (include SQNet, LinkNet, SegNet, UNet, ENet, ERFNet, EDANet, ESPNet, ESPNetv2, LEDNet, ESNet, FSSNet, CGNet, DABNet, Fast-SCNN, ContextNet, FPENet, etc.)

Python
976
1 年前

A list of recent papers, libraries and datasets about 3D shape/scene analysis (by topics, updating).

Python
954
2 年前

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

Python
826
4 年前

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Python
745
5 个月前

🔥🔥Official Repository for Multi-Human-Parsing (MHP)🔥🔥

JavaScript
680
5 个月前

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python
598
6 个月前

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Python
454
2 年前

Implementation of CVPR'20 Oral: Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Python
435
1 年前

Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Pyramid Multi-task Transformer for Dense Scene Understanding"

Python
318
1 年前

[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

Jupyter Notebook
314
2 年前

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Python
302
1 年前

[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis

Python
299
5 个月前

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation

Python
264
10 个月前

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

Python
215
1 年前

[ECCV'18] 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

Python
211
3 年前