forked from kobanium/Ray
-
Notifications
You must be signed in to change notification settings - Fork 56
Parameters
Kensuke Matsuzaki edited this page Aug 7, 2017
·
1 revision
- range: (0.0 <= x <= 1.0)
- default: 0.20
- Resign when winning rate is lower than this value.
- range: (1 <= x)
- default: 15
- Expand search tree node how many times visited.
- range: (0.0 < x)
- default: 2
- a constant determining the level of exploration
- range: (0.0 <= x <= 1.0)
- default: 0.8
- Mix the value network and rollout evaluations
- range: (0.0 <= x)
- default: 0.49
- Softmax temperature of policy network
- range: (0.0 <= x)
- default: 0.056
- range: (0 <= x <= 100)
- default: 0
- Percentage of playing cached moves is tree search in simulation.
- range: (0 <= x <= 100)
- default: 0
- Percentage of playing Last-Good-Reply-Forgetting moves in simulation.

