Hello, author.
I'm facing a problem now. The model I trained is prone to collisions, so I increased the collision penalty. Now the vehicle has learned a strategy to run off the road to avoid the penalty, as shown in the picture. Now the vehicle is outside and there is no collision, so there is no penalty. How can I make the vehicle continue to face penalties after running off the road
