This repository contains the datasets list and scripts required for the ConferencingSpeech challenge. For more details about the challenge, please see our website.
-
baseline, this folder contains baseline system include inference model exported by onnx and inference scripts; -
eval, this folder contains evaluation scripts to calculate PESQ, STOI and SI-SNR; -
selected_lists, the selected wave about train speech and noise wave name from aishell-1, aishell-3, librispeech-360, VCTK, MUSAN, Audioset. Each participant is only allowed to use the selected speech and noise data below :selected_lists/dev/circle.namecircle RIR wave utt name of dev setselected_lists/dev/linear.namelinear RIR wave utt name of dev setselected_lists/dev/non_uniform.namenon uniform linear RIR wave utt name of dev setselected_lists/dev/clean.namewave utt name of dev set used clean setselected_lists/dev/noise.namewave utt name of dev set used noise setselected_lists/train/aishell_1.namewave utt name from aishell-1 set used in train setselected_lists/train/aishell_3.namewave utt name from aishell-3 set used in train setselected_lists/train/librispeech_360.namewave utt name from librispeech-360 set used in train setselected_lists/train/vctk.namewave utt name from VCTK set used in train setselected_lists/train/audioset.namewave utt name from Audioset used in train setselected_lists/train/musan.namewave utt name from MUSAN used in train setselected_lists/train/circle.namecircle wave RIR name of train setselected_lists/train/linear.namelinear wave RIR name of train setselected_lists/train/non_uniform.namenon unifrom linear RIR utt name of train set
-
simulation, about simulation scripts, how to use to see ReadMesimulation/mix_wav.pysimulate dev set and train setsimulation/prepare.shuseselected_lists/*/*nameto select used wave from downloaded raw data, or you can select them by yourself scripts.simulation/quick_select.pyquickly select the name by a name list instead ofgrep -r -fsimulation/challenge_rirgenerator.pythe script to simulate RIRs in train and dev setsimulation/data/dev_circle_simu_mix.configdev circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/dev_linear_simu_mix.configdev linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/dev_non_uniform_linear_simu_mix.configdev non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start pointsimulation/data/train_simu_circle.configtrain circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.simulation/data/train_simu_linear.configtrain linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.simulation/data/train_simu_non_uniform.configtrain non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.
-
requirements.txt, dependency
Notes:
1. \*.config file should be replaced with correct path of audio files.
2. Training config files have been released together with challenge data.
If you use this challenge dataset and baseline system in a publication, please cite the following paper:
@article{wei2021interspeech,
title={{INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing}},
author={Wei Rao and Yihui Fu and Yanxin Hu and Xin Xu and Yvkai Jv and Jiangyu Han and Zhongjie Jiang and Lei Xie and Yannan Wang and Shinji Watanabe and Zheng-Hua Tan and Hui Bu and Tao Yu and Shidong Shang},
journal={arXiv preprint arXiv:2104.00960}
}
python3.6 or above
pip install -r requirements.txtif you simulation RIRs by yourself with our scripts, you may better install this: