StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Jinhui Ye; MichaelYu; Fangjing Wang; Axi404; Yilun Chen; Qixiu Li; Microsoft Open Source; Bin Yu; Chengyao Wang; Jiang Changjiu; Lipeng Wang; Senqiao Yang (杨森乔); Zixuan WANG; jrryzh(SII)

doi:10.5281/zenodo.18264214

Published January 16, 2026 | Version starVLA-1.2

Software documentation Open

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

1. HKUST
2. Fudan University
3. SUSTech
4. Xi'an Jiaotong University
5. The Chinese University of Hong Kong
6. Tsinghua University/Microsoft
7. Microsoft
8. Harbin Institute of Technology & Zhongguancun Academy
9. SII

StarVLA is a modular and flexible codebase for developing Vision-Language Model (VLM) to Vision-Language-Action (VLA) models. In StarVLA (also a pun on “start VLA” ), each functional component (model, data, trainer, config, evaluation, etc.) follows a top-down, intuitive separation and high cohesion and low coupling principle, which enabling plug-and-play design, rapid prototyping, and independent debugging.

Files

starVLA/starVLA-starVLA-1.2.zip

Files (33.2 MB)

Name	Size	Download all
starVLA/starVLA-starVLA-1.2.zip md5:c4a8039bbba001a16ff1d19f281a6128	33.2 MB	Preview Download

Additional details

Is supplement to: Software documentation: https://github.com/starVLA/starVLA/tree/starVLA-1.2 (URL)

Repository URL: https://github.com/starVLA/starVLA
Programming language: Python

starvla2025

409

Views

Downloads

Show more details

	All versions	This version
Views	409	409
Downloads	7	7
Data volume	232.7 MB	232.7 MB

More info on how stats are collected....

Archived in
Available in
Indexed in

DOI

Resource type

Software documentation

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: January 16, 2026
Modified: January 20, 2026

starVLA/starVLA-starVLA-1.2.zip

Files (33.2 MB)

Related works

Software

References

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Authors/Creators

Description

Files

starVLA/starVLA-starVLA-1.2.zip

Files (33.2 MB)

Additional details

Related works

Software

References