As title. I found in solve_nn it is trying to train a small network that maps states to actions. What dose it used for? And is that possible to add force information in the environment state? Thanks
As title.
I found in solve_nn it is trying to train a small network that maps states to actions.
What dose it used for?
And is that possible to add force information in the environment state?
Thanks