Hello, I'm having some problems with arctic/vc1/. I used the code and parameters you provided for training, and used the same dataset without modifying any parameters, only reducing the batch_size(100->64). But in the end, some of the converted wav files have the problem of repetitive generation, i.e. the model doesn't seem to stop generating the sequence at the right frame, but continues to generate until the maximum length of the sequence. In those problematic wav files, the probs plot shows a probability of almost 0 or less than the threshold (0.5).
This happens no matter if I use TTS_aept to fine tune the vc model or train the vc model from scratch.
Please tell me what I am doing wrong or if this is normal, thank you very much!
By the way, the BCEWithLogitsLoss during training quickly approaches 0, around 0.1 at about 190steps, and then the Loss is around 0.00xx until the end of training.
BCE_Loss=0.0000 and L1_Loss=0.2636 at 50000steps.
Thank you!
Hello, I'm having some problems with arctic/vc1/. I used the code and parameters you provided for training, and used the same dataset without modifying any parameters, only reducing the batch_size(100->64). But in the end, some of the converted wav files have the problem of repetitive generation, i.e. the model doesn't seem to stop generating the sequence at the right frame, but continues to generate until the maximum length of the sequence. In those problematic wav files, the probs plot shows a probability of almost 0 or less than the threshold (0.5).
This happens no matter if I use TTS_aept to fine tune the vc model or train the vc model from scratch.
Please tell me what I am doing wrong or if this is normal, thank you very much!
By the way, the BCEWithLogitsLoss during training quickly approaches 0, around 0.1 at about 190steps, and then the Loss is around 0.00xx until the end of training.
BCE_Loss=0.0000 and L1_Loss=0.2636 at 50000steps.
Thank you!