First and foremost, thank you a lot for your work. I was able to download it and plug it into my system with minimal efforts. The code was quite easy to iterate on, and I was quickly able to add some simple features.
I spent the last 3 days working specifically on optimizing the agentic orchestrator, and found some - i believe - very meaningful improvements
I would love to try to integrate them into the product, if you are okay with it. But I did a lot of stuff, including some refacto, some minor changes in the way session works, etc. so I need to take the time to do some more atomic pull requests.
The most important optimisations are coming from:
- Agressively parallelizing tool calls, and defining a "most optimal" 4-steps development framework (find the file to read, read them, plan, write + execute).
- Have a description for each file, a bit like the description available for each skills (hence the refacto i did to split your code into ~200 lines files), enabling better search and diminishing the number of reading turns.
- Design a context garbage collector that the model can use to compress/drop information.
On some internal feature development + bugfixes tasks, i was able to cut input token use (majority of the cost) by a factor of x10, and get a x3-5 speedup compared to claude code vanilla with API billing activated.
Here is my fork for reference : https://github.com/Simon-Free/bouzecode. The README explains in more details my reasoning.
First and foremost, thank you a lot for your work. I was able to download it and plug it into my system with minimal efforts. The code was quite easy to iterate on, and I was quickly able to add some simple features.
I spent the last 3 days working specifically on optimizing the agentic orchestrator, and found some - i believe - very meaningful improvements
I would love to try to integrate them into the product, if you are okay with it. But I did a lot of stuff, including some refacto, some minor changes in the way session works, etc. so I need to take the time to do some more atomic pull requests.
The most important optimisations are coming from:
On some internal feature development + bugfixes tasks, i was able to cut input token use (majority of the cost) by a factor of x10, and get a x3-5 speedup compared to claude code vanilla with API billing activated.
Here is my fork for reference : https://github.com/Simon-Free/bouzecode. The README explains in more details my reasoning.