There is some weird implements in your code. I can't reopen the issue so I just open a new one. about forgotten negtive dn samples: https://github.com/IDEA-Research/DINO/issues/119 about forgotten extra decoder norm https://github.com/IDEA-Research/DINO/issues/147