Hi,
Thank you for your excellent work on this project. It's truly impressive. I am particularly interested in the tokenizer training part of your work. I noticed that you have plans to release the training code for the tokenizer. I was wondering if you could share any information about the estimated timeline for its release. No pressure at all, of course, just very keen to learn from your approach.
Thank you again for your contribution to the community!
Best regards