Skip to content

Improve RL training setup with multi-agent warm starts and richer collision metrics#19

Open
FatemehAB wants to merge 1 commit intoinverted-ai:masterfrom
FatemehAB:updated-api-and-collision-metrics
Open

Improve RL training setup with multi-agent warm starts and richer collision metrics#19
FatemehAB wants to merge 1 commit intoinverted-ai:masterfrom
FatemehAB:updated-api-and-collision-metrics

Conversation

@FatemehAB
Copy link

  • Training script can now load pretrained SB3 models, use stronger multi-agent defaults, and write cleaner WandB metadata.
  • Gym environment returns dict observations, keeps the simulator on the chosen device, records videos when asked, and surfaces the new collision metric.
  • Simulator setup now double-checks scenario data, creates waypoint goals, and keeps background traffic controllers working together as expected.
  • Single-agent wrapper shares the new observations and still returns rewards and done flags as simple numbers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant