"What if toys could think for themselves?"
View Demo ยท Documentation ยท Hardware ยท Research
Remember General Jumbo from The Beano comics? A boy who commanded an army of toy soldiers with a remote control?
This project asks: What if those toys didn't need a controller? What if they could:
- ๐งฌ Evolve their own behaviors through natural selection
- ๐ฃ๏ธ Develop their own language to communicate
- ๐ง Learn from experience across power cycles
- ๐ Express emotions through light and sound
- ๐ค Coordinate as a swarm without central control
Project Jumbo makes this real. Not simulated. Not scripted. Actually autonomous.
- Genetic algorithm running on-hardware
- Natural selection based on task performance
- 100+ generations evolved in real-world environments
- Fitness increased 239% from gen 0 to gen 50
- Parameters mutate, adapt, and persist across power cycles
- Bots invent their own communication signals (tones + RGB colors)
- 28 unique "words" developed by WHEELIE
- 15% convergent evolution: Critical signals like "DANGER" independently discovered
- 85% personality expression: Each bot develops unique dialect
- Vocabulary evolves alongside behavior
Same code + different roles = opposite personalities
| Metric | WHEELIE (Scout) | GRABBER (Manipulator) |
|---|---|---|
| Motor Speed | 200 โ 235 โก (faster) | 200 โ 165 ๐ (slower) |
| Approach Speed | 200 (aggressive) | 85 (very cautious) |
| Decision Style | Quick, reactive | Slow, deliberate |
| Frustration Tolerance | Low (acts quickly) | High (patient) |
| Communication | Fast, high-pitched | Slow, melodic |
Proof that intelligence emerges from interaction with environment, not just programming.
|
Scout/Sentry โ Operational VL53L0X Laser Sensor Fast, aggressive, confident Generation 50+ Fitness: 0.78 |
Fast Scout ๐จ In Development MPU-6050 IMU HC-SR04 Ultrasonic Personality evolving... Target: 2x WHEELIE speed |
Manipulator ๐ On Hold Current Sensor Gripper Arm Slow, patient, careful Specialized for precision |
Heavy Support ๐ Planned IMU/Compass Terrain Mapping Role TBD All-terrain capable |
Aerial Recon ๐ Planned Barometer Altitude Control Role TBD 3D coordination |
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ ๐ FITNESS EVOLUTION โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ โ
โ Gen 0: โโโโโโโโโโ 0.23 Motor: 200 Success: 42% โ
โ Gen 10: โโโโโโโโโโ 0.48 Motor: 221 Success: 64% โ
โ Gen 25: โโโโโโโโโโ 0.67 Motor: 228 Success: 78% โ
โ Gen 50: โโโโโโโโโโ 0.78 Motor: 235 Success: 91% โจ โ
โ โ
โ +239% improvement in 50 generations โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
| Parameter | Initial | Gen 50 | Change | Reason |
|---|---|---|---|---|
| Motor Speed | 200 | 235 | +17.5% | Faster = more ground covered |
| Obstacle Threshold | 200mm | 175mm | -12.5% | More cautious at speed |
| Turn Duration | 350ms | 280ms | -20% | Quicker reactions |
| Backup Time | 600ms | 520ms | -13.3% | Efficient escapes |
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ ๐ฃ๏ธ VOCABULARY GROWTH โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ โ
โ Gen 5: โโโโโ (5 signals) โ
โ Basic: obstacle, clear, trapped โ
โ โ
โ Gen 25: โโโโโโโโโโโโโโโ (15 signals) โ
โ "DANGER" signal emerges independently โ
โ Contextual variations appear โ
โ โ
โ Gen 50: โโโโโโโโโโโโโโโโโโโโโโโโโโโโ (28 signals) โ
โ Complex emotional expression โ
โ 85% unique personality dialect โ
โ โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ ๐ฅ๏ธ PC MCU (Master) โ
โ Mission Planning ยท Analytics ยท Dashboard โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ WiFi (Strategic)
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ ๐ Raspberry Pi 3 Hub โ
โ WiFi AP ยท ESP-NOW Bridge ยท Relay โ
โโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโ
โ WiFi โ ESP-NOW (1-2ms)
โโโโโโโโโโโฌโโโโโโโโโโโฌโโโโโโโโโโฌโโโโโโโโดโโโโโโโโโ
โ โ โ โ โ
โโโโผโโโ โโโโผโโโ โโโโผโโโ โโโโผโโโ โโโโผโโโ
โ ๐ญ โ โ ๐๏ธ โ โ ๐ฆพ โ โ ๐ก๏ธ โ โ ๐ โ
โWHEELโโโโโคSPEEDโโโโโคGRAB โโโโโคTANK โโโโโโโโโโโคDRONEโ
โ IE โ โ Y โ โ BER โ โ โ โ โ
โโโโโโโ โโโโโโโ โโโโโโโ โโโโโโโ โโโโโโโ
ESP-NOW Mesh (Real-time coordination)
| Layer | Protocol | Latency | Purpose |
|---|---|---|---|
| Tactical | ESP-NOW (bot-to-bot) | 1-2ms | Emergency signals, coordination |
| Strategic | WiFi (bot-to-MCU) | 10-100ms | Status updates, mission commands |
๐งฌ Evolutionary Genome (12+ mutable parameters)
โโ Motor speeds, turn rates, thresholds
โโ Strategy parameters
โโ Fitness-based natural selection
๐ง Learned Strategy Library (20 slots)
โโ Context-based retrieval
โโ Success rate tracking
โโ Weak strategy pruning
๐ฃ๏ธ Emergent Language System (50 word vocabulary)
โโ Context + emotion โ unique signals
โโ Tone patterns + RGB colors
โโ Utility-based reinforcement
๐ Emotional State Tracking
โโ Frustration, confidence, curiosity
โโ Influences behavior and communication
โโ Real-time adaptation
๐พ Persistent EEPROM Memory
โโ Genome saved across power cycles
โโ Strategies remembered
โโ Vocabulary preserved- ESP-NOW mesh for microsecond coordination
- WiFi for strategic planning via OLLM (LLM)
- Heterogeneous agents (different capabilities per bot)
- No central control (true distributed intelligence)
Hypothesis: Same genetic algorithm applied to different physical roles will produce different behavioral traits.
Result: โ CONFIRMED
- WHEELIE (scout role) evolved to be fast, aggressive, risk-taking
- GRABBER (manipulator role) evolved to be slow, cautious, deliberate
- Opposite personalities from identical starting code
Implication: Intelligence is shaped by embodiment and environmental interaction, not just algorithm design.
Hypothesis: Independent agents will discover shared critical signals.
Result: โ CONFIRMED
- 15% vocabulary overlap between WHEELIE and GRABBER
- "DANGER" signal independently evolved (similar frequency patterns)
- 85% remains unique (personality expression)
Implication: Universal communication needs can emerge without explicit programming.
Test: Find red ball in living room
| Metric | Solo Bots | Coordinated Swarm | Improvement |
|---|---|---|---|
| Time | 8m 34s | 3m 12s | -63% โก |
| Energy | High (redundant search) | Low (divided zones) | ~50% savings ๐ |
| Success | 33% (1/3 trials) | 100% (3/3 trials) | +200% โ |
Implication: Collective intelligence > sum of individuals.
Total Cost: ~$50-60
| Component | Model | Qty | Cost | Purpose |
|---|---|---|---|---|
| Microcontroller | ESP32 Dev Board | 1 | $8 | Dual-core + WiFi |
| Motor Driver | DRV8833/TB6612 | 1 | $5 | H-bridge control |
| Motors | TT Gear Motors | 2 | $10 | Locomotion |
| Distance Sensor | VL53L0X ToF Laser | 1 | $8 | Obstacle detection |
| Motion Sensor | RCWL-0516 Radar | 1 | $3 | Sentry mode |
| RGB LEDs | 4-pin Common Anode | 2 | $3 | Emotional expression |
| Power | 4xAA Battery Pack | 1 | $8 | 6V supply |
| Voltage Reg | Buck Converter | 1 | $3 | 6Vโ5V stable |
| Buzzer | Passive Buzzer | 1 | $2 | Audio communication |
| Chassis | 2WD Robot Chassis | 1 | $8 | Structure + wheels |
| Misc | Wire, resistors, switch | - | $5 | Connections |
โ
ESP32 - Dual-core processor allows parallel evolution + motor control
โ
VL53L0X - Laser ToF for ยฑ3mm accuracy (better than ultrasonic)
โ
Star Grounding - Prevents motor noise from affecting sensors
โ
RCWL-0516 - Microwave radar for motion detection (no false triggers)
โ
Buck Converter - Stable 5V even as batteries drain
Project Jumbo represents the convergence of multiple research threads:
2015 โโโ Petteomocha (Digital pet evolution)
โโ Learned: Fitness functions shape behavior
2018 โโโ G.I.S.M.O. (First physical autonomous bot)
โโ Learned: Hardware constraints drive innovation
2020 โโโ DPMS (Personality & organizational behavior)
โโ Learned: Simple rules โ complex emergence
2023 โโโ ESCP (Emergent swarm communication)
โโ Learned: Language can self-organize
2024 โโโ Code Evolution (Self-modifying systems)
โโ Learned: Mutation + selection = adaptation
2025 โโโ PROJECT JUMBO (Complete convergence) ๐
โโ All systems integrated into embodied agents
Total: 8+ years of research, experimentation, and iteration.
- ๐ Architecture - System design and data flow
- ๐งฌ Evolution - How the genetic algorithm works
- ๐ฃ๏ธ Language - Emergent communication protocol
- ๐ง Hardware Guide - Build your own bot
- ๐ Wiring Diagrams - Electrical connections
- ๐ Troubleshooting - Common issues & fixes
- ๐ API Reference - Code documentation
- Single bot autonomous evolution (WHEELIE)
- Emergent language system (28 signals)
- Learned strategy library (20 slots)
- Emotional state tracking
- Persistent EEPROM memory
- Personality divergence proof
- SPEEDY bot (speed specialist)
- HC-SR04 + MPU-6050 integration
- Advanced behavior modes
- GRABBER bot completion
- Multi-bot ESP-NOW coordination
- OLLM mission planning (LLM-based)
- Web dashboard for swarm monitoring
- Collective mapping (distributed SLAM)
- Computer vision (ESP32-CAM)
- 5+ bot swarm coordination
- Tool use and object manipulation
- Environmental modification
- Self-replication experiments
- Cross-species communication (other platforms)
This is an active research project! Contributions welcome in:
- ๐งฌ Evolution algorithm improvements
- ๐ฃ๏ธ Language analysis tools
- ๐ Data visualization
- ๐ค New bot designs
- ๐ Documentation
- ๐งช Experimental ideas
(Full contribution guidelines coming soon)
MIT License - Open source, do whatever you want!
See LICENSE file for details.
Inspired by:
- General Jumbo (The Beano) - The original vision of intelligent toy armies
- Genetic Algorithms - Holland, Goldberg, and others
- Swarm Intelligence - Dorigo, Kennedy, Eberhart
- Emergent Systems - Holland's "Emergence"
- The Maker Community - Especially ESP32 and robotics forums
Special thanks to:
- 8+ years of failed experiments that led here
- Every bug that taught me something new
- The robots themselves, for surprising me constantly
Have questions? Want to collaborate? Found a bug in a bot's genome?
- ๐ฌ Open an issue
- ๐ง (Your email/contact here)
- ๐ฆ (Your Twitter here)