Noise suppression fine tune #246

aaronhsueh0506 · 2023-01-31T06:45:56Z

Hi Rikorose,

I'm trying to fine tune some effects, do you have any suggestions for these points?

In harsh environments (lower SNR), since the dataset only yields -5 to 45 dB SNR, the resulting spectrogram has little above 5kHz, can it be improved?
I want to enhance the effect of 8kHz to 14kHz and increase the brightness of the human voice. Can this be improved through post-processing?
In PercepNet, it adds global gain while using warped gain. Do we need to do the same thing here?

Thanks,
Aaron

Rikorose · 2023-02-03T05:57:38Z

Most probably. One could think about adding a connection from a later stage of the DF decoder to the ERB decoder. This however would not allow to only run the ERB decoder without the DF decoder anymore.
This idea is partly implemented within the air absorption distortion, but not properly tested.
Not sure what you mean here. What formula do you mean?

aaronhsueh0506 · 2023-03-02T06:02:01Z

Hi,

I found the result is good while using your website.
Because I re-train the model by Keras, and Keras do not support grouped Conv2DTranspose layer.
I will try to figure out the difference between Keras and Torch.

Best regards,
Aaron

aaronhsueh0506 · 2023-03-07T01:17:02Z

Hi,

I am checking the model inputs and found some differences.
I can use numpy.rfft, vorbis window, and stft_norm get the same value with stft function.

stft_norm = 1 / (n_fft ** 2 / (2 * hop))
    spec = torch.stft(
                audio, n_fft=n_fft, hop_length=hop, window=torch.Tensor(vorbis_window(n_fft)),
                return_complex=True, normalized=False, center=False
            ).transpose(1, 2)

But I found when I send the same signal to df.analysis or df_features in enhance.py, I get different spec with this stft function.
Is there any different?

Another question, is dB rescale important for ERB?

Thanks,

Rikorose · 2023-03-07T07:35:37Z

Code looks good, not sure where you get some differences. dB scaling is important since the raw amplitude does not correlate well with human loudness perception and is thus not a good feature.

aaronhsueh0506 · 2023-03-07T09:57:38Z

Hi,

I try to use this command in enhance.py.
spec, erb_feat, spec_feat = df_features(audio, df_state, device=get_device())
and save spec as a npy files.

Also, use

spec = torch.stft(
                audio, n_fft=n_fft, hop_length=hop, window=torch.Tensor(vorbis_window(n_fft)),
                return_complex=True, normalized=False, center=False
            ).transpose(1, 2) * stft_norm

But these two functions get different values of spec.

DonkeyHang · 2024-07-24T09:18:47Z

Hi,

I try to use this command in enhance.py. spec, erb_feat, spec_feat = df_features(audio, df_state, device=get_device()) and save spec as a npy files.

Also, use
spec = torch.stft(
                audio, n_fft=n_fft, hop_length=hop, window=torch.Tensor(vorbis_window(n_fft)),
                return_complex=True, normalized=False, center=False
            ).transpose(1, 2) * stft_norm
But these two functions get different values of spec.

i have the same question, so are you guess about answer?

for the stream process mode, every process i only have 480 samples(48k ssamplerate and 10ms data), if i had 480 samples delay and 480 samples overlap, neither vobis window and np.fft and torch.fft, it was different result with spec in df.analysis,
it make me confuse...

aaronhsueh0506 · 2024-09-08T13:21:30Z

Hi,

FFT in torch will multiply (nfft ^ -0.5） while normalized default is True, and IFFT will diverse it.

qiuqiu-879 · 2024-12-11T09:26:18Z

Hello. I would like to know how the df.analysis function in df_feature work. I directly spec = torch.stft( audio, n_fft=n_fft, hop_length=hop, window=torch.Tensor(vorbis_window(n_fft)), return_complex=True, normalized=False, center=False ).transpose(1, 2), but the result I got is different from that of the df.analysis. Could you please explain the possible reasons ? Thank you.

Rikorose added the enhancement New feature or request label Feb 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Noise suppression fine tune #246

Noise suppression fine tune #246

aaronhsueh0506 commented Jan 31, 2023

Rikorose commented Feb 3, 2023

aaronhsueh0506 commented Mar 2, 2023 •

edited

Loading

aaronhsueh0506 commented Mar 7, 2023 •

edited

Loading

Rikorose commented Mar 7, 2023

aaronhsueh0506 commented Mar 7, 2023 •

edited

Loading

DonkeyHang commented Jul 24, 2024

aaronhsueh0506 commented Sep 8, 2024

qiuqiu-879 commented Dec 11, 2024

Noise suppression fine tune #246

Noise suppression fine tune #246

Comments

aaronhsueh0506 commented Jan 31, 2023

Rikorose commented Feb 3, 2023

aaronhsueh0506 commented Mar 2, 2023 • edited Loading

aaronhsueh0506 commented Mar 7, 2023 • edited Loading

Rikorose commented Mar 7, 2023

aaronhsueh0506 commented Mar 7, 2023 • edited Loading

DonkeyHang commented Jul 24, 2024

aaronhsueh0506 commented Sep 8, 2024

qiuqiu-879 commented Dec 11, 2024

aaronhsueh0506 commented Mar 2, 2023 •

edited

Loading

aaronhsueh0506 commented Mar 7, 2023 •

edited

Loading

aaronhsueh0506 commented Mar 7, 2023 •

edited

Loading