Code for "Pareto Policy Pool for Model-based Offline RL", presented in ICLR 2022.
python==3.6.13
- d4rl==1.1
- ray==1.0.0
- gym==0.18.3
- torch==1.7.1
- tensorflow==2.3.1
- mujoco-py==2.0.2.13python p3.pyPretrained environment models and behaviour cloning policies can be downloaded via Google Drive.
If you use the code in P3, please kindly cite our paper using following BibTeX entry.
@inproceedings{
yang2022pareto,
title={Pareto Policy Pool for Model-based Offline Reinforcement Learning},
author={Yijun Yang and Jing Jiang and Tianyi Zhou and Jie Ma and Yuhui Shi},
booktitle={International Conference on Learning Representations},
year={2022},
url={https://openreview.net/forum?id=OqcZu8JIIzS}
}
We appreciate the open source of the following projects: