You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
/// <param name="outAction">Output action. If action space is continuous, it is the mean; if aciton space is discrete, it is the probability of each action</param>
26
-
/// <param name="outValue"></param>
26
+
/// <param name="outValue">outout value.</param>
27
27
/// <param name="outVariance">output variance. Only needed if the action space is continuous. It can either have batch dimension or not for RLModelPPO</param>
/// Same as AddExperience(), called every loop when training. You are supposed to process the collected data for episodes or something. You can do it in AddExperience as well...This method is called right after AddExperience().
81
+
/// </summary>
82
+
/// <param name="currentInfo">infomation of the agents before the action taken.</param>
83
+
/// <param name="newInfo">infomation of the agents after tha ction taken</param>
/// When this returns true, UpdateModel() will be called();
88
+
/// </summary>
89
+
/// <returns>Whether it is ready to udpate the model.</returns>
42
90
boolIsReadyUpdate();
91
+
92
+
/// <summary>
93
+
/// Put all of your logic for training the model. This is called when IsReadyUpdate() returns true.
94
+
/// </summary>
43
95
voidUpdateModel();
44
96
97
+
/// <summary>
98
+
/// Return whether training is enabled. AddExperience(), ProcessExperience() and UpdateModel() will not be called if it returns false.
99
+
/// </summary>
100
+
/// <returns></returns>
45
101
boolIsTraining();
46
102
}
47
103
104
+
105
+
/// <summary>
106
+
/// A abstract class for trainer if you want to save some time impelmenting ITrainer...It provides some helper functions and stuff..., you can use this as based class instead of ITrainer.
0 commit comments