Repository navigation

#

embodied-agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1493
1 个月前

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

530
1 年前

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python
497
1 年前

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

463
3 个月前

RAI is a vendor-agnostic agentic framework for robotics, utilizing ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.

Python
369
13 小时前

An open source framework for research in Embodied-AI from AI2.

Python
367
4 天前

Odyssey: Empowering Minecraft Agents with Open-World Skills

Python
326
2 个月前

Brain-Body Co-Design in Embodied Intelligence: Taxonomy, Frontiers, and Challenges

195
2 个月前

[arXiv 2023] Embodied Task Planning with Large Language Models

Python
188
2 年前

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python
165
4 个月前

[CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"

Python
104
18 天前

[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

Python
94
2 个月前

A collection of vision-language-action model post-training methods.

88
5 天前

Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

81
1 个月前

[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.

Python
79
4 个月前