Skip to content
/ rlffp Public

This project sets up the SO100 robotic arm in MuJoCo simulation using the so100-mujoco-sim repo and LeRobot interface, enabling easy switching between virtual and real robots.

Notifications You must be signed in to change notification settings

Dimios45/rlffp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

Robot-Learning-From-First-Principles

A hands-on project combining simulation and real-world control of the SO100 robotic arm using vision, language, and reinforcement learning.

File Structure

rlffp/
├── README.md
└── report/
    ├── assets/
    ├── report.md
    └── literature_review.md

Contents

Literature Review

We examine current state of the art models(SOTA's) in robot learning:

  • Action Chunking with Transformers (ACT): An advanced imitation learning framework that uses transformers and conditional variational autoencoders to predict action sequences from human demonstrations.
  • SmolVLA: A compact vision-language-action model that adapts pretrained vision-language models for robotic control with efficient deployment.
  • HIL-SERL: A human-in-the-loop sample-efficient reinforcement learning approach that combines imitation learning with reinforcement learning through human corrective feedback.

Main Report

Our comprehensive report covers:

  • Reinforcement Learning for Robotic Manipulation: Core concepts and popular algorithms like Proximal Policy Optimization (PPO) applied to robotic manipulation tasks.
  • Sim-to-Real Transfer Challenges: An in-depth analysis of the difficulties in transferring policies from simulation to real-world hardware, including camera alignment, visual perception gaps, and hardware calibration issues.
  • Flow Policy Optimization (FPO): An innovative approach that combines flow-based generative models with policy gradient methods, enabling more expressive policy representations.
  • Implementation Details: Our practical implementation of FPO for the PickCube task with the SO100 robotic arm. PPO and FPO Implementation Deployed on SO100 arm
  • Performance Comparison: Evaluation results comparing FPO and PPO approaches in both simulation and real-world settings.

For detailed information, please refer to the full documents:

About

This project sets up the SO100 robotic arm in MuJoCo simulation using the so100-mujoco-sim repo and LeRobot interface, enabling easy switching between virtual and real robots.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published