[NVC-Net] About 16 kHz training and model convergence #64

Aria-K-Alethia · 2022-12-08T19:33:02Z

Hi,

Thank you for sharing your great work!

I'm using nvcnet to train a Japanese voice conversion model, I have two questions.

First, I try to adapt your code to 16 kHz wavs, I did the following two manipulations:

changed sr in hparams.py from 22050 into 16000
changed segment_length in hparams.py from 32768 into 16384
The training goes well but the performance is bad even after 400 epochs.

I wonder if you have any idea on training nvcnet on 16 kHz wavs? Do I need any other modifications to ensure the training will go well ?

Second, could you share the value of g_loss_rec when the model converges?.
In my training the g_loss_rec converged to around 0.9 to 1.2, I'm not sure if this is what I should expect in model convergence.

The text was updated successfully, but these errors were encountered:

bacnguyencong-sony · 2022-12-12T13:56:19Z

Hi,
we haven't tried to train NVC-Net on 16 kHz. Under the current hyper-parameters, g_loss_rec should be around 1.2 to1.3
In your case, it could be that other weighing terms are not appropriate for 16 kHz.

Aria-K-Alethia · 2022-12-12T14:07:31Z

Hi,

Thank you for your answering!
Do you mean that the converged g_loss_rec value is normal in my case?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NVC-Net] About 16 kHz training and model convergence #64

[NVC-Net] About 16 kHz training and model convergence #64

Aria-K-Alethia commented Dec 8, 2022

bacnguyencong-sony commented Dec 12, 2022

Aria-K-Alethia commented Dec 12, 2022

[NVC-Net] About 16 kHz training and model convergence #64

[NVC-Net] About 16 kHz training and model convergence #64

Comments

Aria-K-Alethia commented Dec 8, 2022

bacnguyencong-sony commented Dec 12, 2022

Aria-K-Alethia commented Dec 12, 2022