Data generation code #2

pratheeksk · 2019-03-13T06:21:24Z

hi,
I wanted to try out the code, how can I generate the data, is the code for data generation also uploaded to GitHub

jnoelvictorino · 2019-11-16T14:21:41Z

Hi,

I am also interested on how the mixing / data generation is done.

divyeshrajpura4114 · 2020-01-16T20:16:38Z

Authors have adopted method from Single-Channel Multi-Speaker Separation using Deep Clustering to generate mixture of data. You can get script over here Deep Clustering. I hope this would help you.

xin-h963 · 2020-02-13T06:34:14Z

@divyeshrajpura4114
Thank you for the link!
I can generate mixture of data but how can I get input feature, wiener-filter like mask, ideal binary mask, weight threshold matrix? (which is needed in data_utils.py)
I used TIMIT dataset.

divyeshrajpura4114 · 2020-02-13T08:22:29Z

@xin-h963 T-F mask is just ratio of spectrograms of different speakers present in mixture. Please read the literature about Time-Frequency Mask. Below are some suggetion,

Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design.
On the Ideal Ratio Mask as the Goal of Computational Auditory Scene Analysis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data generation code #2

Data generation code #2

pratheeksk commented Mar 13, 2019

jnoelvictorino commented Nov 16, 2019

divyeshrajpura4114 commented Jan 16, 2020

xin-h963 commented Feb 13, 2020

divyeshrajpura4114 commented Feb 13, 2020

Data generation code #2

Data generation code #2

Comments

pratheeksk commented Mar 13, 2019

jnoelvictorino commented Nov 16, 2019

divyeshrajpura4114 commented Jan 16, 2020

xin-h963 commented Feb 13, 2020

divyeshrajpura4114 commented Feb 13, 2020