Thank you for your great work and for sharing the project!
I noticed that reproducing your training requires train_stage1_data.json and train_stage2_data.json, but these files don’t seem to be available at the moment. I’m wondering if you have any plans to open-source them in the future to help others reproduce your results?