DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION (Accepted to ICASSP 2021 Conference)

Amir Ivry, Prof. Israel Cohen, Dr. Baruch Berdugo

Andrew and Erna Viterbi Faculty of Electrical and Computer Engineering, Technion - Israel Institute of Technology

This research proposes a residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain. This system embeds a design parameter that allows a tunable tradeoff between the desired-signal distortion and residual echo suppression in double-talk scenarios. The system employs 136 thousand parameters, and requires 1.6 Giga floating-point operations per second and 10 Mega-bytes of memory. The implementation satisfies both the timing requirements of the AEC challenge and the computational and memory limitations of on-device applications. Experiments are conducted with 161 h of data from the AEC challenge database and from real independent recordings. We demonstrate the performance of the proposed system in real-life conditions and compare it with two competing methods regarding echo suppression and desired-signal distortion, generalization to various environments, and robustness to high echo levels
We share the code here for reproducability and it is our hope you will also find it instructive for speech residual echo suppression. You are also encouraged to refer to the more elaborated published paper. Demo can be found here.

General Information

This code implements a deep learning-based residual echo suppressor that is meant to preserve desired speech and cancel echo in mono acoustic echo cancellation setups. This implementation is computationaly lean, and embeds a training objective function with a dedicated design parameter. This parameter dynamically controls the trade-off between speech distortion and echo suppression that the system exhibits. A pytorch model is provided with a Python-MATLAB API that allows training and inference.

Setup

To prepare for usage, the user should follow these steps:

Clone this repo
Create a MATLAB project with the following folder leveling, where 'data folder' contains two subfolders - 'train' and 'test':

_{MATLAB leveling}

The 'train' folder holds the 'mic.pcm', 'ref.pcm', and 'target.pcm' files. The 'test' folder holds the same without the 'target.pcm'
Set up a virtual environment and run: pip install -r requirements.txt

Usage

Open mainScript.m and follow internal MATLAB's documentation on how to insert user parameters and how to employ the PYTHON API. The user will be required to mention the desired scenario (training/testing) and provide relative path to parent data directory. In case of 'train' mode, user will also need to choose statistics to apply on the test set, and existing Pytorch model.

Acknowledgements

This research was supported by the Pazy Research Foundation, the Israel Science Foundation (ISF), and the International Speech Communication Association (ISCA). We would also like to thank stem audio for their technical support.
If you use this repo or other instance of this research, please cite the following:
@inproceedings{ivry2021objective,
title={DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION},
author={Ivry, Amir and Cohen, Israel and Berdugo, Baruch},
booktitle={ICASSP},
year={2021},
organization={IEEE}
}

Contact

Created by Amir Ivry - feel free to contact me also via [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Python		Python
STFT		STFT
mainDev		mainDev
npy-matlab		npy-matlab
utils		utils
README.md		README.md
mainScript.m		mainScript.m
model.pt		model.pt
stats.mat		stats.mat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION (Accepted to ICASSP 2021 Conference)

Amir Ivry, Prof. Israel Cohen, Dr. Baruch Berdugo

Andrew and Erna Viterbi Faculty of Electrical and Computer Engineering, Technion - Israel Institute of Technology

Table of Contents

General Information

Setup

Usage

Acknowledgements

Contact

About

Releases

Packages

Languages

AmirIvry-aka-AI/Tunable-Residual-Echo-Suppression

Folders and files

Latest commit

History

Repository files navigation

DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION (Accepted to ICASSP 2021 Conference)

Amir Ivry, Prof. Israel Cohen, Dr. Baruch Berdugo

Andrew and Erna Viterbi Faculty of Electrical and Computer Engineering, Technion - Israel Institute of Technology

Table of Contents

General Information

Setup

Usage

Acknowledgements

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages