One-Shot Imitation under Mismatched Execution

Kushal Kedia^*, Prithwish Dan^*, Angela Chao, Maximus A. Pace, Sanjiban Choudhury (^*Equal Contribution) Cornell University

Simulation Dataset

All datasets will be loaded in the code using HuggingFace API.

Datasets can be found at: https://huggingface.co/datasets/prithwishdan/RHyME

Installation

Follow these steps to install RHyME:

Create and activate the conda environment:

cd rhyme
conda env create -f environment.yml
conda activate rhyme
pip install -e .

Before running any scripts, make sure to set "base_dev_dir" to your working directory for the codebase. You may directly write this value into the config files under ./config/simulation, or alternatively override the argument in the command line when running scripts.

Training 🏋️

RHyME consists of three steps:

1. 👁️ Training shared visual encoder

Run the following script to pretrain the visual encoder. By default, we use the Sphere-Easy dataset.

python scripts/train_vision_encoder.py

For different demonstrator types, use these configs:

python scripts/train_vision_encoder.py --config-name=easy_pretrain_hf
python scripts/train_vision_encoder.py --config-name=medium_pretrain_hf
python scripts/train_vision_encoder.py --config-name=hard_pretrain_hf

2. 🔗 Automatic pairing of cross-embodiment datasets

Compute sequence-level distance metrics and generate pairings from unpaired datasets using pre-trained visual encoder:

bash scripts/automatic_pairing.sh --pretrain_model_name <name> --checkpoint <num> --cross_embodiment <type>

Required Parameters:

--pretrain_model_name: Folder name of vision encoder in ./experiment/pretrain
--checkpoint: Checkpoint number
--cross_embodiment: Dataset type (sphere-easy, sphere-medium, sphere-hard)

3. 🤖 Hybrid visuomotor policy training

Train conditional diffusion policy to translate imagined demonstrator videos into robot actions:

python scripts/train_diffusion_policy.py \
    --pretrain_model_name <name> \
    --pretrain_ckpt <num> \
    --eval_cfg.demo_type <type> \

Required Parameters:

pretrain_model_name: Folder name of vision encoder in ./experiment/pretrain
pretrain_ckpt: Checkpoint number
eval_cfg.demo_type: Dataset type (sphere-easy, sphere-medium, sphere-hard)

Evaluation 📊

Evaluate policy on unseen demonstrator videos:

python scripts/eval_checkpoint.py

Key parameters:

pretrain_model_name: Folder name of vision encoder in ./experiment/pretrain
pretrain_ckpt: Checkpoint number
eval_cfg.demo_type: Specifies which demonstrator to evaluate on
policy_name: Folder name of diffusion policy in ./experiment/diffusion_bc/kitchen

BibTeX

@article{
   kedia2024one,
   title={One-shot imitation under mismatched execution},
   author={Kedia, Kushal and Dan, Prithwish and Chao, Angela and Pace, Maximus Adrian and Choudhury, Sanjiban},
   journal={arXiv preprint arXiv:2409.06615},
   year={2024}
}

Acknowledgement

Much of the training pipeline is adapted from XSkill.
Diffusion Policy is adapted from Diffusion Policy
Many useful utilities are adapted from XIRL.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config/simulation		config/simulation
resources		resources
rhyme		rhyme
scripts		scripts
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

One-Shot Imitation under Mismatched Execution

Simulation Dataset

Installation

Training 🏋️

1. 👁️ Training shared visual encoder

2. 🔗 Automatic pairing of cross-embodiment datasets

3. 🤖 Hybrid visuomotor policy training

Evaluation 📊

BibTeX

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

portal-cornell/rhyme

Folders and files

Latest commit

History

Repository files navigation

One-Shot Imitation under Mismatched Execution

Simulation Dataset

Installation

Training 🏋️

1. 👁️ Training shared visual encoder

2. 🔗 Automatic pairing of cross-embodiment datasets

3. 🤖 Hybrid visuomotor policy training

Evaluation 📊

BibTeX

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages