GitHub - jaybutera/tetrisRL: A Tetris environment to train machine learning agents

Installation

$ git clone https://github.com/jaybutera/tetrisRL
$ cd tetrisRL
$ uv sync

Layout

src/dqn_agent.py - DQN reinforcement learning agent trains on tetris
src/supervised_agent.py - The same convolutional model as DQN trains on a dataset of user playthroughs
src/user_engine.py - Play tetris and accumulate information as a training set
src/run_model.py - Evaluate a saved agent model on a visual game of tetris (i.e.)

$ uv run python src/run_model.py checkpoint.pth.tar

Usage

Using the Environment

The interface is similar to an OpenAI Gym environment.

Initialize the Tetris RL environment

from src.engine import TetrisEngine

width, height = 10, 20
env = TetrisEngine(width, height)

Simulation loop

# Reset the environment
obs = env.clear()

while True:
    # Get an action from a theoretical AI agent
    action = agent(obs)

    # Sim step takes action and returns results
    obs, reward, done = env.step(action)

    # Done when game is lost
    if done:
        break

Example Usages

Play Tetris for Training Data

Play games and accumulate a data set for a supervised learning algorithm to trian on. An element of data stores a (state, reward, done, action) tuple for each frame of the game.

You may notice the rules are slightly different than normal Tetris. Specifically, each action you take will result in a corresponding soft drop This is how the AI will play and therefore how the training data must be taken.

To play Tetris:

$ uv run python src/user_engine.py

Controls:
W: Hard drop (piece falls to the bottom)
A: Shift left
S: Soft drop (piece falls one tile)
D: Shift right
Q: Rotate left
E: Rotate right

At the end of each game, choose whether you want to store the information of that game in the data set. Data accumulates in a local file called 'training_data.npy'.

Example supervised learning agent from data

Run the supervised agent file and specify the standard training data file generated in the previous step as a command line argument.

$ uv run python src/supervised_agent.py training_data.npy

Example reinforcement learning agent

# Start from a new randomized dqn agent
$ uv run python src/dqn_agent.py
# Start from a the last recorded dqn checkpoint
$ uv run python src/dqn_agent.py resume
# Specify a custom checkpoint
$ uv run python src/dqn_agent.py resume supervised_checkpoint.pth.tar

The DQN agent currently optimizes on a metric of freedom of action. In essence the agent should learn to maximize the entropy of the board. A player in Tetris has the most freedom of action when the area is clear of pieces.

Watch a checkpoint play a game

$ uv run python src/run_model.py checkpoint.pth.tar

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
src		src
LICENSE		LICENSE
README.md		README.md
plot_training.py		plot_training.py
pyproject.toml		pyproject.toml
tetrisRL_logo.png		tetrisRL_logo.png
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Layout

Usage

Using the Environment

Example Usages

Play Tetris for Training Data

Example supervised learning agent from data

Example reinforcement learning agent

Watch a checkpoint play a game

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

License

jaybutera/tetrisRL

Folders and files

Latest commit

History

Repository files navigation

Installation

Layout

Usage

Using the Environment

Example Usages

Play Tetris for Training Data

Example supervised learning agent from data

Example reinforcement learning agent

Watch a checkpoint play a game

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages