Hi there:
Great work, the opensource models have great results.
Two questions:
1. I see that you decided to replace 32 erb bands with 480 magnitude bands. Is it because erb bands do not work out very well?
2. Regarding fine-tuning, deepfilternet models definitely suffers from the long-silence issue. And I am looking to recreate it for deepfilternet.
My question, do you just extend the augmentation to 30-40 seconds, or implement special implementation for 30-40 seconds for fine-tuning.
And do you have to freeze some of the models, and fine-tune the grus only?
Thanks!
Hi there:
Great work, the opensource models have great results.
Two questions:
1. I see that you decided to replace 32 erb bands with 480 magnitude bands. Is it because erb bands do not work out very well?
2. Regarding fine-tuning, deepfilternet models definitely suffers from the long-silence issue. And I am looking to recreate it for deepfilternet.
My question, do you just extend the augmentation to 30-40 seconds, or implement special implementation for 30-40 seconds for fine-tuning.
And do you have to freeze some of the models, and fine-tune the grus only?
Thanks!