The HWU64 dataset contains 25k samples according to the original paper. The DialoGLUE paper stats the same number of samples.
However, the Readme states 11k samples.
If I count the number of samples which are actually in the HWU64 part of DialoGLUE then I get 12,112 samples (12k).
My questions:
- Is there a reason for the difference in numbers in the original HWU64 and in the DialoGLUE HWU64? Or is it a bug?
- Did you compute the performance of the intent prediction models on 25k, 12k or 11k samples?
Thank you for your answers :)
The HWU64 dataset contains 25k samples according to the original paper. The DialoGLUE paper stats the same number of samples.
However, the Readme states 11k samples.
If I count the number of samples which are actually in the HWU64 part of DialoGLUE then I get 12,112 samples (12k).
My questions:
Thank you for your answers :)