Dear authors: In CLIP2Scene, is the original data of CLIP image-text pair? Best
Dear authors:
Best