Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

This is the official code for "Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation" in ICLR2026

This project is developed using CUDA 12.4, PyTorch 1.12.1, python 3.10.

After installing a GPU version of PyTorch, other dependencies can be installed via pip install -r requirements.txt.

Our code is based on https://github.com/huanzhang12/ATLA_robust_RL.

The following is a explanation of this repository. For a more detailed explanation, please refer to the base repository.

Training the Victim Policy

All the following code is assumed to be executed in the src directory.

The scan files in the configs directory allow you to explore various hyperparameters.

cd ../configs
python window-close-v2_vanilla_ppo_scan.py

cd ../src
python run_agents.py ../configs/agent_configs_window-close-v2_vanilla_ppo_scan/ --out-dir-prefix=../configs/agents_window-close-v2_vanilla_ppo_scan > window-close-v2_vanilla_ppo_scan.log

The trained models will be saved under ../configs/agents_window-close-v2_vanilla_ppo_scan

Evaluating the Victim Policy

First, you need to save the model. You can get best_model.YOUR_EXP_ID.model by running the following code:

python get_best_pickle.py ../configs/agents_window-close-v2_vanilla_ppo_scan/000/YOUR_EXP_ID

You can evaluate the performance of the created model using test.py. It is recommended to use the --deterministic option when evaluating.

python test.py --config-path ../configs/agent_configs_window-close-v2_vanilla_ppo_scan/000.json --load-model best_model.YOUR_EXP_ID.model --deterministic

Training the Adversarial Policy

To train an adversarial policy using BIA, you first need to collect trajectories using collect_demo.py. It is recommended to use vanilla PPO trained on the attacker's target task as the target policy.

python collect_demo.py --config-path TARGET_MODEL_CONFIG.json --load-model TARGET_MODEL.model --deterministic

The collected trajectories will be saved in the demo/History directory. Please move the required trajectories to demo/YOUR_DEMO.pkl.

The adversarial policy can be trained in the same way as the victim policy.

cd ../configs
python attack_vanilla_ppo_window-close-v2_ilfd_scan.py

cd ../src
python run_agents.py ../configs/agent_configs_attack_vanilla_ppo_window-close-v2_ilfd_scan/ --out-dir-prefix=../configs/agents_attack_vanilla_ppo_window-close-v2_ilfd_scan > attack_vanilla_ppo_window-close-v2_ilfd_scan.log

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

Training the Victim Policy

Evaluating the Victim Policy

Training the Adversarial Policy

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

Training the Victim Policy

Evaluating the Victim Policy

Training the Adversarial Policy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages