Bipedal-Walker

Overview

The challenge for this coursework was to build an agent to efficiently learn to walk in Gym's BipedalWalker environment, for both the normal and hardcore versions. The terrain in the normal environment is flat, whereas the hardcore environment contains many obstacles that the agent must navigate.

LA3P + SAC

I trained my agents using Loss Adjusted Approximate Actor Prioritsed Experience Replay (LA3P) and Soft Actor Critic (SAC) [1], for both environments producing agents which learnt very quickly to walk. Convergence for the normal environment, was "outstanding", and my agent achieved "top-of-class" results in the hardcore version. My report details my approach and how it was applied differently to each environment.

For this coursework I received 86/100.

Normal (Episode 1,400)

Hardcore (Episode 1,300)

[1] Baturay Saglam et al. “Actor prioritized experience replay”. In: Journal of Artificial Intelligence Research 78 (2023), pp. 639–672.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
jbtl68-agent-code.ipynb		jbtl68-agent-code.ipynb
jbtl68-agent-hardcore-log.txt		jbtl68-agent-hardcore-log.txt
jbtl68-agent-hardcore-video,episode=1300,score=302.gif		jbtl68-agent-hardcore-video,episode=1300,score=302.gif
jbtl68-agent-hardcore-video,episode=1300,score=302.mp4		jbtl68-agent-hardcore-video,episode=1300,score=302.mp4
jbtl68-agent-log.txt		jbtl68-agent-log.txt
jbtl68-agent-paper.pdf		jbtl68-agent-paper.pdf
jbtl68-agent-video,episode=1400,score=332.gif		jbtl68-agent-video,episode=1400,score=332.gif
jbtl68-agent-video,episode=1400,score=332.mp4		jbtl68-agent-video,episode=1400,score=332.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bipedal-Walker

Overview

LA3P + SAC

Normal (Episode 1,400)

Hardcore (Episode 1,300)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bipedal-Walker

Overview

LA3P + SAC

Normal (Episode 1,400)

Hardcore (Episode 1,300)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages