In `rollout.py`, `run` and `run_episode` share most of the code. We can merge them and replace `run_episode` with `run(every_episodes=1, is_train=False)`.