This project implements a Fleet Management System (FMS) in a 2D grid-based warehouse environment.
Multiple robots navigate, pick up boxes, deliver them to drop zones, avoid collisions, and manage battery levels.
A Fleet Management System is responsible for:
- Assigning tasks to robots
- Planning paths for navigation
- Avoiding collisions
- Managing resources like battery
In real-world warehouses (like Amazon), FMS ensures efficient and safe robot coordination.
In this project, robots operate using:
- Grid-based movement (Up, Down, Left, Right)
- Local navigation using 5x5 observation window
- Distance-based movement towards target
- If robot is not carrying → nearest box assigned
- If robot is carrying → nearest drop zone assigned
- Swap detection (robots swapping positions blocked)
- Destination conflict handling (multiple robots same cell blocked)
- Battery decreases every step
- Charging stations restore battery
- No intelligent scheduling (rule-based)
👉 This is mostly rule-based + heuristic system
If Reinforcement Learning is applied, system becomes much smarter:
- Learn optimal task allocation
- Minimize total travel time
- Reduce collisions automatically
- Learn when to charge vs when to deliver
- Handle multi-robot coordination dynamically
- Decide best time to go to charger
- Avoid unnecessary charging
- Balance delivery vs charging
- Fewer collisions
- Faster deliveries
- Better resource utilization
- 0: Free Space
- 1: Obstacle
- 2: Charging Station
- 3: Box (Pickup)
- 4: Drop Location
- 5: Robot
- Move in 4 directions + stay
- Pick and drop boxes
- Battery consumption per step
- Local perception (5x5 grid like LiDAR)
- Collision avoidance
- Multiple robots operate simultaneously
- Collision handling:
- Swap conflict prevention
- Destination conflict resolution
- Nearest box assigned if not carrying
- Nearest drop assigned if carrying
- Battery decreases each step
- Charging station restores battery
- Low battery threshold triggers charging (basic logic)
- +50 → Delivery completed
- +5 → Box picked
- +200 → All deliveries completed
- +1 → Charging when battery is low
- +distance improvement → moving closer to target
- -0.5 → Collision / invalid move
- -50 → Battery depletion
Each robot gets a 31-dimensional vector:
- 5x5 local grid (25 values)
- Battery level
- Carrying status
- Relative target position
- Relative charger position
- 0: Up
- 1: Down
- 2: Left
- 3: Right
- 4: Stay
- 1 box
- Low battery drain
- Simple navigation
- 4 boxes
- Moderate battery drain
- Requires coordination
- 8 boxes
- High battery drain
- Complex coordination + charging
- All boxes delivered
- Battery reaches 0
- Max steps reached
- Designed a custom multi-robot simulation environment
- Implemented collision-safe movement logic
- Built task allocation and navigation system
- Created RL-compatible observation + reward system
This project demonstrates how a Fleet Management System can be built using rule-based planning.
While current implementation uses heuristics (path planning + allocation),
it can be extended with Reinforcement Learning to achieve:
- Smarter decision making
- Better coordination
- Efficient energy usage
- Reduced collisions
👉 This makes it a strong foundation for real-world warehouse robotics systems.