Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NVC-Net] About 16 kHz training and model convergence #64

Open
Aria-K-Alethia opened this issue Dec 8, 2022 · 2 comments
Open

[NVC-Net] About 16 kHz training and model convergence #64

Aria-K-Alethia opened this issue Dec 8, 2022 · 2 comments

Comments

@Aria-K-Alethia
Copy link

Hi,

Thank you for sharing your great work!

I'm using nvcnet to train a Japanese voice conversion model, I have two questions.

First, I try to adapt your code to 16 kHz wavs, I did the following two manipulations:

  1. changed sr in hparams.py from 22050 into 16000
  2. changed segment_length in hparams.py from 32768 into 16384
    The training goes well but the performance is bad even after 400 epochs.

I wonder if you have any idea on training nvcnet on 16 kHz wavs? Do I need any other modifications to ensure the training will go well ?

Second, could you share the value of g_loss_rec when the model converges?.
In my training the g_loss_rec converged to around 0.9 to 1.2, I'm not sure if this is what I should expect in model convergence.

@bacnguyencong-sony
Copy link
Contributor

Hi,
we haven't tried to train NVC-Net on 16 kHz. Under the current hyper-parameters, g_loss_rec should be around 1.2 to1.3
In your case, it could be that other weighing terms are not appropriate for 16 kHz.

@Aria-K-Alethia
Copy link
Author

Hi,

Thank you for your answering!
Do you mean that the converged g_loss_rec value is normal in my case?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants