Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data generation code #2

Open
pratheeksk opened this issue Mar 13, 2019 · 4 comments
Open

Data generation code #2

pratheeksk opened this issue Mar 13, 2019 · 4 comments

Comments

@pratheeksk
Copy link

hi,
I wanted to try out the code, how can I generate the data, is the code for data generation also uploaded to GitHub

@jnoelvictorino
Copy link

Hi,

I am also interested on how the mixing / data generation is done.

@divyeshrajpura4114
Copy link

Authors have adopted method from Single-Channel Multi-Speaker Separation using Deep Clustering to generate mixture of data. You can get script over here Deep Clustering. I hope this would help you.

@xin-h963
Copy link

@divyeshrajpura4114
Thank you for the link!
I can generate mixture of data but how can I get input feature, wiener-filter like mask, ideal binary mask, weight threshold matrix? (which is needed in data_utils.py)
I used TIMIT dataset.

@divyeshrajpura4114
Copy link

@xin-h963 T-F mask is just ratio of spectrograms of different speakers present in mixture. Please read the literature about Time-Frequency Mask. Below are some suggetion,

  1. Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design.
  2. On the Ideal Ratio Mask as the Goal of Computational Auditory Scene Analysis.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants