Skip to content

"Deep residual echo suppression with a tunable tradeoff between signal distortion and echo suppression", ICASSP, 2021.

Notifications You must be signed in to change notification settings

AmirIvry-aka-AI/Tunable-Residual-Echo-Suppression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION (Accepted to ICASSP 2021 Conference)

Amir Ivry, Prof. Israel Cohen, Dr. Baruch Berdugo

Andrew and Erna Viterbi Faculty of Electrical and Computer Engineering, Technion - Israel Institute of Technology

This research proposes a residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain. This system embeds a design parameter that allows a tunable tradeoff between the desired-signal distortion and residual echo suppression in double-talk scenarios. The system employs 136 thousand parameters, and requires 1.6 Giga floating-point operations per second and 10 Mega-bytes of memory. The implementation satisfies both the timing requirements of the AEC challenge and the computational and memory limitations of on-device applications. Experiments are conducted with 161 h of data from the AEC challenge database and from real independent recordings. We demonstrate the performance of the proposed system in real-life conditions and compare it with two competing methods regarding echo suppression and desired-signal distortion, generalization to various environments, and robustness to high echo levels
We share the code here for reproducability and it is our hope you will also find it instructive for speech residual echo suppression. You are also encouraged to refer to the more elaborated published paper. Demo can be found here.

Table of Contents

General Information

This code implements a deep learning-based residual echo suppressor that is meant to preserve desired speech and cancel echo in mono acoustic echo cancellation setups. This implementation is computationaly lean, and embeds a training objective function with a dedicated design parameter. This parameter dynamically controls the trade-off between speech distortion and echo suppression that the system exhibits. A pytorch model is provided with a Python-MATLAB API that allows training and inference.

Setup

To prepare for usage, the user should follow these steps:

  • Clone this repo
  • Create a MATLAB project with the following folder leveling, where 'data folder' contains two subfolders - 'train' and 'test':

folder_structure

MATLAB leveling

  • The 'train' folder holds the 'mic.pcm', 'ref.pcm', and 'target.pcm' files. The 'test' folder holds the same without the 'target.pcm'
  • Set up a virtual environment and run: pip install -r requirements.txt

Usage

Open mainScript.m and follow internal MATLAB's documentation on how to insert user parameters and how to employ the PYTHON API. The user will be required to mention the desired scenario (training/testing) and provide relative path to parent data directory. In case of 'train' mode, user will also need to choose statistics to apply on the test set, and existing Pytorch model.

Acknowledgements

This research was supported by the Pazy Research Foundation, the Israel Science Foundation (ISF), and the International Speech Communication Association (ISCA). We would also like to thank stem audio for their technical support.
If you use this repo or other instance of this research, please cite the following:
@inproceedings{ivry2021objective,
title={DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION},
author={Ivry, Amir and Cohen, Israel and Berdugo, Baruch},
booktitle={ICASSP},
year={2021},
organization={IEEE}
}

Contact

Created by Amir Ivry - feel free to contact me also via [email protected].

About

"Deep residual echo suppression with a tunable tradeoff between signal distortion and echo suppression", ICASSP, 2021.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published