"topic:embodied-agent" — Search

RAI is a vendor agnostic agentic framework for Physical AI robotics, utilizing ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.

Python46358Updated 5 hours ago

aiai-agents-frameworkembodied-agentembodied-agentsembodied-aiembodied-artificial-intelligencegenerative-aillmmulti-agent-systemsmultimodalo3dephysical-airobotecroboticsros2vlm

allenai/allenact

An open source framework for research in Embodied-AI from AI2.

Python37859Updated 6 months ago

aiai2computer-visiondeep-learningembodied-agentpythonreinforcement-learningresearch

zju-vipa/Odyssey

Odyssey: Empowering Minecraft Agents with Open-World Skills

Python36721Updated 4 months ago

agentembodied-agentfine-tuninglarge-language-modellarge-language-modelsllmllm-agentminecraft

Yuxing-Wang-THU/SurveyBrainBody

Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges

30019Updated 1 month ago

agentbrain-body-co-designbrain-body-co-optimizationembodied-agentembodied-artificial-intelligenceevolutionmodularrobotsroboticssurvey

mbodiai/embodied-agents

Seamlessly integrate state-of-the-art transformer models into robotics stacks

Python28132Updated 2 months ago

agentsartificial-intelligencediffusionembodiedembodied-agentembodied-agentsgenerative-ailarge-language-modelsllmmbodimbodiaimultimodalroboticstransformervision-language-modelvlm

iris0329/SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python2136Updated 10 months ago

3d-scene-understanding3d-visual-groundingembodied-agentembodied-aiopen-vocabularyvlmzero-shot

sopaco/cortex-mem

🧠 The production-ready cognitive foundation for autonomous systems such as OpenClaw and Embodied-AI. For memory management, from extraction and search to automated optimization, with API, MCP, CLI, and insights dashboard out-of-the-box.

Rust20511Updated 1 hour ago

clawhubembodied-agentembodied-cognitionopenclaw

Gary3410/TaPA

[arXiv 2023] Embodied Task Planning with Large Language Models

Python19313Updated 2 years ago

ai2thorembodied-agentllamarobotics

AoqunJin/Awesome-VLA-Post-Training

A collection of vision-language-action model post-training methods.

1375Updated 4 days ago

embodied-agentembodied-aifine-tuningpost-trainingvision-language-action-modelvla

hanxunyu/Inst3D-LMM

[CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"

Python1295Updated 1 month ago

3d-llmsembodied-agentmulti-task-learningscene-understanding

HorizonRobotics/HoloAgent

A unified, agentic system for general-purpose robots, enabling multi-modal perception, mapping and localization, and autonomous mobility and manipulation, with intelligent interaction with users.

C++1276Updated 1 month ago

embodied-agentroboticvln

Zhoues/MineDreamer

[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

Python1047Updated 9 months ago

diffusion-modelembodied-agentminecraftmultimodal-llm

Linketic/TC-Light

[NeurIPS`25] TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer

Python1014Updated 3 months ago

codebookdenoiseembodied-agentembodied-ailong-videoneurips-2025optimizationrelightrelightingtemporal-consistencyvideovideo-editing

OceanGPT/OceanGym

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Python1008Updated 1 month ago

agentautonomous-underwater-vehicleauvbenchmarkembodied-agentembodied-aiembodied-intelligencelarge-language-modelsmultimodal-large-language-modelsnatural-language-processingoceangptoceangym

ProgressLM/ProgressLM

Teaching Vison-Language Models as Progress Estimators across Embodied Scenarios

Python959Updated 1 month ago

embodied-agentmultimodal-llmroboticsspatialvision-language-model

mazpie/genrl

[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.

Python874Updated 11 months ago

embodied-agentembodied-aifoundation-modelsmultimodalreinforcement-learningworld-models

wendell0218/GVA-Survey

Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

851Updated 2 days ago

embodied-agentgeneralist-virtual-agentgui-agentgvallmmllmmulti-agent-systemsurveyvirtual-agentvlm

bigai-nlco/langsuite

Official Repo of LangSuitE

Python843Updated 1 year ago

acl2024autonomous-agentsembodied-agentlarge-language-modelsllm-as-agent

Josh00-Lu/DiffusionVeteran

[ICLR 2025 Spotlight] Official PyTorch Implementation of "What Makes a Good Diffusion Planner for Decision Making?"

Python807Updated 10 months ago

decison-makingembodied-agentpytorchreinforcement-learning

declare-lab/Emma-X

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Python797Updated 10 months ago

agentschain-of-thoughtembodied-agentembodied-aiembodied-artificial-intelligenceembodied-intelligencellmroboticsvlavlm

xyz9911/FLAME

[AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"

Python685Updated 4 months ago

embodied-agentlarge-multimodal-modelsmultimodal-large-language-modelsstreetviewvision-and-language-navigationvision-language-model

Grigorij-Dudnik/RoboCrew

🦾Set up your embodied LLM agent with the same ease as normal agents in CrewAI or Autogen

Python625Updated 14 hours ago

ai-robot-controlembodied-agentembodied-aiembodied-llm-agentmobilerobotsrobotrobotics-controlxlerobot

Page 1 of 3