Skip to content

Question about using test set to select model? #1

@kingofapplehead

Description

@kingofapplehead

Hi,

Thanks for sharing your work and code! I wonder how your final best result is obtained?

I would assume that the results reported in the sota comparison table (TABLE II) are your test set results. And I noticed that that result (89.2 89.7 89.5 78.9) are just the best results in your ablation study in TABLE VIII. So, it seems to me that you relied on the test set to select the best kernel size/fusion to report the best results. Wouldn't this make your results biased and overfit on the test set? I thought that test sets should never be involved in any model selection /hyperparam selection process.

Following this, could you give more details about how the train, val, and test set are used in your model training and hyperparameter tuning? For example, why did you use the val set to select the best learning rate but not other hyperparameters like the kernel size/fusion?

Please excuse me and correct me if I understand your work incorrectly.

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions