Skip to content

Commit 00878b4

Browse files
authored
Update Training-PPO.md
1 parent 87a1b52 commit 00878b4

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

Documents/Training-PPO.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -63,9 +63,9 @@ This is a simple implementation of RLNetworkAC that you can create a plug it in
6363
## Training with Heuristics
6464
If you already know some policy that is better than random policy, you might give it as a hint to PPO to increase the training a bit.
6565

66-
1. Implement the [AgentDependentDeicision- needs link](dfsdf) for your policy and attach it to the agents that you want them to occasionally use this policy.
66+
1. Implement the [AgentDependentDeicision](AgentDependentDeicision.md) for your policy and attach it to the agents that you want them to occasionally use this policy.
6767
2. In your trainer parameters, set `useHeuristicChance` to larger than 0.
68-
3. Use [TrainerParamOverride - needs link](asdfs) to decrease the `useHeuristicChance` over time during the training.
68+
3. Use [TrainerParamOverride](TrainerParamOverride.md) to decrease the `useHeuristicChance` over time during the training.
6969

7070

7171

0 commit comments

Comments
 (0)