Skip to content

Conversation

thanhtcptit
Copy link

Hi, thank you for your work. I noticed an error in the RoPE inner product equation. Additionally, this implementation uses a different feature pairing strategy for feature subspaces rotation compared to the original paper, which I believe is worth noting to avoid confusion.
Ref: https://github.com/pytorch/torchtune/blob/main/torchtune/modules/position_embeddings.py#L117

Cheer,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants