Hi @frank-xwang,
I noticed the implementation of the DropLoss of Mask2Former differs from the implementation used in Cascade Mask R-CNN. However, the VideoCutLER paper does not share many details of this implementation. Could you please explain the DropLoss implementation for Mask2Former?
Thanks for the help!
Christoph