This is a very basic AI-based tic-tac-toe game.Good accuracy is achieved by allowing the agents to play against each other for about 10,000 times.The agent learns to interact with the environment by remaining in a state and reward is the incentive which drives the agent towards its goal of winning.The state in which the agent wins is assigned reward 1 and rewards of all other states leading to final state is calculated iteratively.
-
Notifications
You must be signed in to change notification settings - Fork 0
shub124/Reinforcement_learning
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published