Code Repository for the paper "Rethinking Visual-Language-Action Model Scaling: Alignment, Mixture, and Regularization"
You can download the pre-trained checkpoints from Hugging Face:
- EEF Relative + OXE: Download here
- EEF Relative + RealSimEEFJoint: Download here
To run the pre-training stage, use the following scripts:
# Stage 2 Pre-training
bash shell/pretrain-M1-240k-stage2.sh
# Pre-training
bash shell/pretrain-M1-240k.sh
First, download the required LIBERO datasets:
Preprocess the data:
python src/data_postprocessor/libero.py
Run the post-training script:
bash shell/relative-post-libero-full-eef_relative-5shot.sh
First, download the RoboCasa dataset:
Preprocess the data:
python src/data_postprocessor/robocasa_human.py
Run the post-training script:
bash shell/relative-post-robocasa-full-eef_relative.sh
Ensure the LIBERO environment is installed.
Run the evaluation script:
bash shell/eval-libero-relative.sh
Ensure the RoboCasa environment is installed.
Run the evaluation script:
bash shell/eval-robocasa-relative.sh
We thank the authors of the following projects for their contributions to the robotics and machine learning communities: