rl-falling-blocks

Building a few RL implementations on my falling blocks game.

Imitation learning

Imitation learning algorithm based on my own game play of the game (used as a benchmark).

Basic REINFORCE algorithm with average reward baseline for unbiased (high variance) agent.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
data		data
model		model
scripts		scripts
.DS_Store		.DS_Store
README.md		README.md