Reinforcement-Learning-agent

The purpose of this project is to compare the performances of three different learning agents - Reinforcement Learning (Monte-Carlo policy based gradient), Actor-critic Agent, and Approximate Q-Learning.

Running Pacman

In order to visualize the pacman running under one of the agents, simply run python pacman.py with the option to choose layout, agent type, number of runs, training iterations, etc.

Option	Description
-h, --help	Show help message and exit
-n GAMES, --numGames=GAMES	The number of GAMES to play [Default: 1]
-l LAYOUT_FILE, --layout=LAYOUT_FILE	the LAYOUT_FILE from which to load the map layout [Default: mediumClassic]
-p TYPE, --pacman=TYPE	the agent TYPE in the pacmanAgents module to use [Default: KeyboardAgent]
-t, --textGraphics	Display output as text only
-q, --quietTextGraphics	Generate minimal output and no graphics
-g TYPE, --ghosts=TYPE	the ghost agent TYPE in the ghostAgents module to use [Default: RandomGhost]
-k NUMGHOSTS, --numghosts=NUMGHOSTS	The maximum number of ghosts to use [Default: 4]
-z ZOOM, --zoom=ZOOM	Zoom the size of the graphics window [Default: 1.0]
-f, --fixRandomSeed	Fixes the random seed to always play the same game
-r, --recordActions	Writes game histories to a file (named by the time they were played)
--replay=GAMETOREPLAY	A recorded game file (pickle) to replay
-a AGENTARGS, --agentArgs=AGENTARGS	Comma separated values sent to agent. e.g. "opt1=val1,opt2,opt3=val3"
-x NUMTRAINING, --numTraining=NUMTRAINING	How many episodes are training (suppresses output) [Default: 0]
--frameTime=FRAMETIME	Time to delay between frames; <0 means keyboard [Default: 0.1]
-c, --catchExceptions	Turns on exception handling and timeouts during games
--timeout=TIMEOUT	Maximum length of time an agent can spend computing in a single game [Default: 30]

Generating Layouts

The layouts may be randomly-generated using test.sh script, with the arguments layout, run_number, and training_episodes. The results will be stored in the results folder.

Running Student's T-Test

The Student's T-Test may be run between each pair of agents, for each generated layout in the results folder.

The result of the T-Test will be stored in t_test_results.json.

Running Normality Test

TODO: Insert instructions here

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
layouts		layouts
results		results
.gitignore		.gitignore
README.md		README.md
VERSION		VERSION
analysis.py		analysis.py
autograder.py		autograder.py
crawler.py		crawler.py
do_t_test.py		do_t_test.py
environment.py		environment.py
featureExtractors.py		featureExtractors.py
game.py		game.py
ghostAgents.py		ghostAgents.py
grading.py		grading.py
graphicsCrawlerDisplay.py		graphicsCrawlerDisplay.py
graphicsDisplay.py		graphicsDisplay.py
graphicsGridworldDisplay.py		graphicsGridworldDisplay.py
graphicsUtils.py		graphicsUtils.py
gridworld.py		gridworld.py
keyboardAgents.py		keyboardAgents.py
layout.py		layout.py
layoutGenerator.py		layoutGenerator.py
learningAgents.py		learningAgents.py
mdp.py		mdp.py
pacman.py		pacman.py
pacmanAgents.py		pacmanAgents.py
plotGraph.py		plotGraph.py
projectParams.py		projectParams.py
qlearningAgents.py		qlearningAgents.py
reinforcementTestClasses.py		reinforcementTestClasses.py
reinforcementlearningAgents.py		reinforcementlearningAgents.py
submission_autograder.py		submission_autograder.py
t_test_results.json		t_test_results.json
test.sh		test.sh
testClasses.py		testClasses.py
testParser.py		testParser.py
textDisplay.py		textDisplay.py
textGridworldDisplay.py		textGridworldDisplay.py
util.py		util.py
valueIterationAgents.py		valueIterationAgents.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning-agent

Running Pacman

Generating Layouts

Running Student's T-Test

Running Normality Test

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning-agent

Running Pacman

Generating Layouts

Running Student's T-Test

Running Normality Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages