shape error

Hi, When I run the demo_maxvqa.py for a test, something is wrong with the shape:

Traceback (most recent call last):
  File "E:/MaxVQA-master/demo_maxvqa.py", line 167, in <module>
    a = inference(video)
  File "E:/MaxVQA-master/demo_maxvqa.py", line 160, in inference
    vis_feats = visual_encoder(data["aesthetic"].to(device), data["technical"].to(device))
  File "D:\tools\Anaconda\set\envs\python37tf\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
    return forward_call(*input, **kwargs)
  File "E:\MaxVQA-master\model\visual.py", line 19, in forward
    clip_feats = clip_feats[1:].reshape(7,7,-1,1024).permute(3,2,0,1)
RuntimeError: shape '[7, 7, -1, 1024]' is invalid for input of size 64512


vis_feats = visual_encoder(data["aesthetic"].to(device), data["technical"].to(device))
data["aesthetic"]---[3, 64, 224, 224]
data["technical"]---[3, 128, 224, 224]

The specific problem is found in the following two lines of code
clip_feats = self.clip_visual(x_aes)
clip_feats = clip_feats[1:].reshape(7,7,-1,1024).permute(3,2,0,1)
However, the shape of clip_feats is [64, 1024]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shape error #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

shape error #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions