vision-language-action

Star

Here are 13 public repositories matching this topic...

showlab / ShowUI

Star

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

agent vision-language-model vision-language-action computer-use gui-agent

Updated May 29, 2025
Python

xiaomi-research / recogdrive

Star

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

reinforcement-learning autonomous-driving diffusion-policy vision-language-action vision-language-models navsim end-to-end-a

Updated Sep 5, 2025
Python

ucla-mobility / AutoVLA

Star

[NeurIPS 2025] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

autonomous-driving vision-language-action reinforcement-finetuning grpo

Updated Sep 19, 2025

2toinf / UniAct

Star

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

robotics embodied-ai foundation-models vision-language-action universal-actions

Updated Mar 24, 2025
Python

BridgeVLA / BridgeVLA

Star

✨✨【NeurIPS 2025】Official implementation of BridgeVLA

robotics embodied-ai vision-language-pretraining 3d-manipulation vision-language-action

Updated Sep 20, 2025
Python

TongUI-agent / TongUI-agent

Star

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

agent vision-language-model vision-language-action computer-use gui-agent vision-language-action-model computer-use-agent tongui

Updated Jul 11, 2025
HTML

jiaming-zhou / X-ICM

Star

official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method

manipulation vision-language-action

Updated Sep 7, 2025
Python

worldbench / awesome-vla-for-ad

Star

🌐 A curated collection of vision-language-action (VLA) models for autonomous driving applications

awesome-list autonomous-driving 3d vla vlm embodied-ai large-language-models llm multimodal-large-language-models vision-language-action vision-language-models

Updated Sep 17, 2025
HTML

YuZhaoshu / Efficient-VLA-Survey

Star

🔥This is a curated list of "A survey on Efficient Vision-Language Action Model" research. We will continue to maintain and update the repository, so follow us to keep up with the latest developments!!!

efficient vla embodied-ai vision-language-action vision-language-action-model

Updated Sep 20, 2025

SS47816 / AGI-Elo

Star

AGI-Elo: How Far Are We From Mastering A Task?

benchmark leaderboard agi imagenet coco artificial-general-intelligence datasets evaluation-metrics elo-rating rating-system evaluation-framework sota ai-benchmarks waymo-open-dataset mmlu vision-language-action ai-evaluation-framework livecodebench navsim

Updated May 21, 2025
Python

miladfa7 / PickAgent

Star

PickAgent: OpenVLA-powered Pick and Place Agent | Gradio&Simulation | Vision Language Action Model

ai deep-learning gradio vision-language-model vision-language-action openvla

Updated Aug 10, 2025
Python

pl909 / VLAGen

Star

VLAGen: Automated Data Collection for Generalizing Robotic Policies

robot ai ml vision-language-action

Updated Feb 23, 2025
Python

OmniJarvis / omnijarvis.github.io

Star

Project Page of OmniJARVIS

agent minecraft vision-language-action

Updated Jul 2, 2024
HTML

Improve this page

Add a description, image, and links to the vision-language-action topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-action topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-action

Here are 13 public repositories matching this topic...

showlab / ShowUI

xiaomi-research / recogdrive

ucla-mobility / AutoVLA

2toinf / UniAct

BridgeVLA / BridgeVLA

TongUI-agent / TongUI-agent

jiaming-zhou / X-ICM

worldbench / awesome-vla-for-ad

YuZhaoshu / Efficient-VLA-Survey

SS47816 / AGI-Elo

miladfa7 / PickAgent

pl909 / VLAGen

OmniJarvis / omnijarvis.github.io

Improve this page

Add this topic to your repo