"topic:vision-language-action-model" — Search

57 results for “topic:vision-language-action-model”

[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python2.8k196Updated 12 hours ago

pretraining-for-roboticsrobotic-foundation-modelrobotic-manipulationvision-language-action-model

OpenHelix-Team/VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python2.0k182Updated 18 hours ago

embodied-airoboticsvision-language-action-model

starVLA/starVLA

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python1.3k142Updated just now

robotic-foundation-modelroboticsvision-language-action-modelvlavlm

BaiShuanghao/Awesome-Robotics-Manipulation

A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.

87846Updated 17 hours ago

diffusion-policygraspingimitation-learninginput-modelinglatent-learningpolicy-learningrobot-manipulationvision-language-action-model

thu-ml/Motus

Official code of Motus: A Unified Latent Action World Model

Python83133Updated 2 hours ago

diffusion-modelrobotic-manipulationroboticsunidiffuservideo-generationvision-language-action-modelworld-model

InternRobotics/InternNav

InternRobotics' open platform for building generalized navigation foundation models.

Jupyter Notebook71184Updated 4 hours ago

mllmsnavigationroboticsspatial-aispatial-intelligencevision-language-action-modelvision-language-navigationvisual-navigationvlavlm

datawhalechina/every-embodied

仅需Python基础，从0构建自己的具身智能机器人；从0逐步构建VLA/OpenVLA/SmolVLA/Pi0，深入理解具身智能

Jupyter Notebook65586Updated just now

embodied-agentembodied-aiembodied-intelligenceopenvlasmolvlavision-language-action-model

DriveVLA/OpenDriveVLA

[AAAI 2026] OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

Python63059Updated 16 hours ago

autonomous-drivingend-to-end-autonomous-drivingvision-language-action-model

2toinf/X-VLA

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++54544Updated 8 hours ago

cloth-foldingflorence-2manipulationpretrained-modelsroboticsrobotics-datasetvision-language-action-modelvision-language-model

InternRobotics/InternVLA-M1

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python38219Updated 1 day ago

roboticsvision-language-action-modelvision-language-model

InternRobotics/InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Python36925Updated 7 hours ago

manipulationroboticsvision-language-action-model

OpenHelix-Team/OpenHelix

OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation

Python35016Updated 5 hours ago

dual-systemvision-language-action-model

OpenDriveLab/kai0

Code for kai0, including training, inference and data collection.

C++2519Updated 2 hours ago

model-soupsrobotic-manipulationvision-language-action-model

OpenHelix-Team/ReconVLA

Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.

Python22115Updated 23 hours ago

embodiedroboticsvision-language-action-model

declare-lab/nora

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Python20819Updated 6 days ago

vision-language-action-model

fudan-generative-vision/WAM-Flow

[CVPR 2026] WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving

Python18828Updated 3 days ago

autonoumous-drivingvision-language-action-modelvla

OpenHelix-Team/LLaVA-VLA

LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]

Python1824Updated 12 hours ago

llavaroboticsvision-language-action-model

fudan-generative-vision/WAM-Diff

WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving

Python17028Updated 3 days ago

autonomous-drivingvision-language-action-modelvla

hzxie/DynamicVLA

The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)

1694Updated 23 hours ago

roboticsvision-language-action-modelvla

OpenHelix-Team/Unified-Diffusion-VLA

🔥 The first open-sourced diffusion vision-langauge-action model.

Python1636Updated 1 week ago

vision-language-action-model

om-ai-lab/OpenTrackVLA

Open & Reproducible Research for Tracking VLAs

Python1398Updated 4 days ago

embodied-aiopen-track-vlaopentrackvlatrack-vlatrackvlavision-language-action-modelvision-language-modelvla

YuZhaoshu/Efficient-VLAs-Survey

🔥This is a curated list of "A survey on Efficient Vision-Language Action Models" research. We will continue to maintain and update the repository, so follow us to keep up with the latest developments!!!

1346Updated 1 day ago

efficientembodied-aivision-language-actionvision-language-action-modelvla

AoqunJin/Awesome-VLA-Post-Training

A collection of vision-language-action model post-training methods.

1345Updated 5 days ago

embodied-agentembodied-aifine-tuningpost-trainingvision-language-action-modelvla

OpenHelix-robot/awesome-dual-system-vla

A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.

1094Updated 1 day ago

dual-systemvision-language-action-model

declare-lab/nora-1.5

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Python946Updated 2 days ago

vision-language-action-model

lihzha/lap

LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transfer

Python753Updated 1 day ago

cross-embodimentrobot-learningvision-language-action-model

TongUI-agent/TongUI-agent

[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents

HTML745Updated 5 days ago

agentcomputer-usecomputer-use-agentgui-agenttonguivision-language-actionvision-language-action-modelvision-language-model

RoyZry98/MoLe-VLA-Pytorch

[AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation

Python643Updated 5 days ago

knowledge-distillationmixture-of-expertsvision-language-action-model

OpenHelix-Team/CEED-VLA

Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.

Python491Updated 1 week ago

roboticsspeculative-decodingvision-language-action-modelvision-language-model

nvidia-isaac/nvblox_mindmap

mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies

Python460Updated 6 days ago

3d-reconstructionfeature-mappinghumanoid-robotroboticsspatial-memoryvision-language-action-model

Page 1 of 2