@joonson @amirjamaludin
Hi, Thank you for your contribution on this work.
I'm interested in this work, and I'd like to reproduce it in Tensorflow.
In section 2 of your paper, you said that 37.7 hours, 678,389 samples from both VoxCeleb and LRW datasets are used for training.
When preparing data, I found videos far more than 37.7 hours.
So, how did you filter the dataset to 37.7 hours?
Besides, how many hours did you use for training?
Thank you!
I'm looking forward to your reply!
@joonson @amirjamaludin
Hi, Thank you for your contribution on this work.
I'm interested in this work, and I'd like to reproduce it in Tensorflow.
In section 2 of your paper, you said that 37.7 hours, 678,389 samples from both VoxCeleb and LRW datasets are used for training.
When preparing data, I found videos far more than 37.7 hours.
So, how did you filter the dataset to 37.7 hours?
Besides, how many hours did you use for training?
Thank you!
I'm looking forward to your reply!