-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CQT2010 problematic output #85
Comments
Hi! I would like to contribute. Do you think the problem is in |
@migperfer Thanks for your interest in contributing! I think the problem is from the downsampling functions inside As for padding, there is a |
Aaahh I see. Thanks! But there is no Anyway, I think we can add more details to the documentation, so to specify that when |
Yes, there is no I am still thinking if there is a case where Consider the following case, where we have an audio clip with 5 time steps When So, it seems the |
Yes, sorry I wasn't thinking clearly 😅 |
But anyhow, I would appreciate that if you are willing to fix the downsampling issue in |
Hi! I think it's solved now. I have a few questions though:
|
Thanks! I am reviewing your pull request and it seems I simply messed up the sin and cos kernels.
I will be running the unit test and if everything is okay, I will accept your pull request #94. For the two points above, maybe you can make another pull request later? |
Sure! Number 1 is already done, let me experiment with how can we do number 2 without breaking things that are working now. |
Ok. I did point 2 but I'm couldn't try it for float16, because PyTorch's 2D float16 convolutions don't work on CPU, and I don't have a GPU on my personal laptop. In case you have a GPU, would you mind if I do the pull request and you try it on yours? It is working for float64, but just to double-check. The code I used for float16/half-precision:
The code I used for float64/double-precision:
|
Definitely, a pull request is welcome. I will try it on my GPU later. |
Can you check if It seems I have finished testing your new pull request, everything seems okay so far. The final thing to improve is the scipy import statement. |
Can you double check if the PyTorchLighning runs at 16-bit precision with your modified code? I have just double checked that even with my old code, I can simply do But after your modification, the |
It seems I messed up something when updating nnAudio from
0.1.15
to0.2.0
.The output for
CQT2010
is very different fromCQT2010v2
. I suspect something is wrong during downsampling. But I don't have time to debug at the moment, will post this as an issue to reminder me later. Or if anyone knows the solution to this problem, a pull request is welcome.The following code produces the above-mentioned issue. The code below is using nnAudio 0.2.2
The text was updated successfully, but these errors were encountered: