Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
-
Updated
Jan 30, 2025 - Jupyter Notebook
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
VLAGen: Automated Data Collection for Generalizing Robotic Policies
A simple and scalable codebase for training and fine-tuning vision-language-action models (VLAs) for generalist robotic manipulation:
Add a description, image, and links to the vision-language-action topic page so that developers can more easily learn about it.
To associate your repository with the vision-language-action topic, visit your repo's landing page and select "manage topics."