Improve RL training setup with multi-agent warm starts and richer collision metrics by FatemehAB · Pull Request #19 · inverted-ai/torchdriveenv

FatemehAB · 2025-10-23T17:26:49Z

Training script can now load pretrained SB3 models, use stronger multi-agent defaults, and write cleaner WandB metadata.
Gym environment returns dict observations, keeps the simulator on the chosen device, records videos when asked, and surfaces the new collision metric.
Simulator setup now double-checks scenario data, creates waypoint goals, and keeps background traffic controllers working together as expected.
Single-agent wrapper shares the new observations and still returns rewards and done flags as simple numbers.

…lision metrics

Improve RL training setup with multi-agent warm starts and richer col…

d6b40ff

…lision metrics

Provide feedback