Learning safe value functions

To extend the theoretical work in Safe Value Functions to practical reinforcement learning (RL) algorithms and provide empirical evaluations. Specifically, we want to show that we can learn safe value functions and, consequently, viable sets as defined by A Learnable Safety Measure and Beyond Basins of Attraction: Quantifying Robustness of Natural Dynamics using RL algorithms. We plan to show, that using this framework, we can learn a safety supervisor that knows the set of all safe policies and therefore enable safe learning after we we have learned an initial safe policy.

Setup

If you want to use virtual enviroment

$ conda env create -f DQL-SVF.yml

Activate the enviroment

$ conda activate DQL-SVF

Install gym-cartpole-swingup

$ pip install gym-cartpole-swingup

Run

Start to learn safe value functions for hovership dynamics

$ python configs/hovership_config.py

Issues

Module not found: one way to solve this is to add your working space path to PYTHONPATH in .bashrc

 $ gedit ~/.bashrc
 $ export PYTHONPATH="${PYTHONPATH}:/home/alextseng/deep-q-learning-on-safe-value-functions

Results

On hovership example as a proof of concept, we show that once we learn accurate enough safety supervisor, in the transfer learning stage, the learning is safe and sampling efficiently compared to learning from scratch.
. We evaluate our framework on the high-dimensional task, i.e., inverted pendulum, to shed light on future works.

To see more details, see Report and Slides.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
ground_truth		ground_truth
models		models
plotting		plotting
DQL-SVF.yaml		DQL-SVF.yaml
README.md		README.md
learning_pipeline.py		learning_pipeline.py
logging_class.py		logging_class.py
network.py		network.py
replay_buffer.py		replay_buffer.py
safety_supervisor.py		safety_supervisor.py
soft_actor_critic.py		soft_actor_critic.py
start_learning_with_safety.py		start_learning_with_safety.py
start_q_learning.py		start_q_learning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning safe value functions

Setup

Run

Issues

Results

About

Uh oh!

Releases

Packages

Languages

middleyuan/Learning-Safe-Value-Functions

Folders and files

Latest commit

History

Repository files navigation

Learning safe value functions

Setup

Run

Issues

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages