When I tried to run project_feature.py, I encountered problem at :
|
image_embedding = dinov2_vitb14(color).reshape((68, 91, 768)).permute(2, 0, 1).unsqueeze(0) |
I print the shape of dinov2_vitb14(color), it seem to be [1, 768], which can't be reshape to [68, 91, 768]:
$ python pseudo_mask_gen/project_feature.py
projecting multiview features to point cloud...
processing scene0000_00...
torch.Size([1, 768])
Traceback (most recent call last):
File "/home/lyd/Part2Object/pseudo_mask_gen/project_feature.py", line 185, in <module>
image_embedding = image_embedding.reshape((68, 91, 768)).permute(2, 0, 1).unsqueeze(0)
RuntimeError: shape '[68, 91, 768]' is invalid for input of size 768
Here, I use the full version of scannet, for each color image come with 1296*968 pixels, the dino model I use is dinov2_vitb14_pretrain.pth according to the original code.
Am I using the wrong input? Please let me know, thank you.
When I tried to run
project_feature.py, I encountered problem at :Part2Object/pseudo_mask_gen/project_feature.py
Line 183 in 65d728e
I print the shape of
dinov2_vitb14(color), it seem to be[1, 768], which can't be reshape to[68, 91, 768]:Here, I use the full version of scannet, for each color image come with
1296*968pixels, the dino model I use isdinov2_vitb14_pretrain.pthaccording to the original code.Am I using the wrong input? Please let me know, thank you.