Question: DropLoss Implementation of Mask2Former

Hi @frank-xwang,

I noticed the implementation of the DropLoss of Mask2Former differs from the implementation used in Cascade Mask R-CNN. However, the VideoCutLER paper does not share many details of this implementation. Could you please explain the DropLoss implementation for Mask2Former?

Thanks for the help!
Christoph