You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Documents/IntelligentPoolDetails.md
+15-6Lines changed: 15 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,15 +18,19 @@ During the development of the materials for the Computational Intelligence in Ga
18
18
alt="SimpleCase"
19
19
width="600" border="10" />
20
20
</p>
21
-
21
+
<palign="center">
22
+
<em>Simple case scene</em>
23
+
</p>
22
24
In the end, at least until now, PPO is not working at all. We ended up make a even simpler case than the simple case in the beginning, with only one red ball and 4 pockets, and the game restarts after every shot.
23
25
24
26
<palign="center"id="simpler-case-image">
25
27
<img src="Images/IntelligentPool/SimplerCase.png"
26
28
alt="SimplerCase"
27
29
width="600" border="10" />
28
30
</p>
29
-
31
+
<palign="center">
32
+
<em>Even simpler case scene</em>
33
+
</p>
30
34
Here I will go through the development process, describe each example scenes, tell how to play with them, and explain why I think the billiard game does not work directly with pure PPO or supervised learning.
31
35
32
36
## What we have tried
@@ -120,7 +124,10 @@ This case, the same method is used as in case 3, but with much simpler scenario.
According to the heatmap, now the "better solutions" are not that scatterd as in case 3 anymore. That means it is easier for MAES to find the optimal solutions and the average solutions learned by neural network make more sense.
@@ -135,15 +142,17 @@ After collecting 20000 samples and training the neural network as in case 2 for
135
142
</p>
136
143
137
144
<palign="center">
138
-
<em>With neural network only</em>
145
+
<em>Simpler case played by neural network only</em>
139
146
</p>
140
147
141
-
If we use the output from neural network as the initial guess of the optimizer, the iteration count is about reduced from 10 to 5 in our case. Nice!
148
+
If we use the output from neural network as the initial guess of the optimizer, the average iteration count to find a satisfied solution is about reduced from 10 to 5 in our case. Nice!
<em>MAES iteration comparison when changingfrom MAES only to MAES with neural network- blue line: average iteration count. purple line: average score. </em>
147
156
</p>
148
157
149
158
### Case 6 - 1 red ball, 4 pockets, one shot, MAES and Supervised Learning using GAN
0 commit comments