Hi!
I see that you are still pretty actively updating this codebase. I've been working on expanding it to add other model architectures (Mamba (which I already got working), RWKV, Hyena). Was wondering if you would be interested in a pull request, adding those models, or if you think it would pollute and confuse people as your article only mentions Llama.
Nice work,
Gonçalo