Skip to content

cannot reproduce the F1-score reported in the paper. #1

@calx-git

Description

@calx-git

I have run the inference script and tried to reproduce the results reported in the paper. However, after I tried the checkpoints of PGL-Sum and VASNet, I found that the output results were not consistent with the ones reported in the paper. The results I got after running the inference script are:

-----------------------------------------------------------
TEST RESULT on ckpts/vasnet/vasnet1_best_f1.pkl:
TEST MRHiSum F-score 40.968 | MAP50 0.58994 | MAP15 0.25663
-----------------------------------------------------------
-----------------------------------------------------------
TEST RESULT on ckpts/pgl_sum/pgl_sum3_best_f1.pkl:                                                                                                                                 
TEST MRHiSum F-score 41.527 | MAP50 0.6173 | MAP15 0.27549
-----------------------------------------------------------

While the results reported in the paper are:
PGL-SUM 55.89 ±0.04 (F1-Score) 61.60 ±0.14 (mAP-50%) 27.45 ±0.15 (mAP-15%)
VASNet 55.26 ±0.05 (F1-Score) 58.69 ±0.30 (mAP-50%) 25.28 ±0.40 (mAP-15%)

Would you please help check what the problem is?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions