in line 57 of masked_huber_loss.lua, it says 1 is for impossible features. it is actually 0 for impossible features. So line 65 should actually be `(batch_size * feature_size) / self.mask_sum:sum()` Lines 58-60 should also be changed