A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning

Chenggang Zhang, Xueliang Zhang


Acoustic echo cancellation (AEC) is used to cancel feedback between a loudspeaker and a microphone. Ideally, AEC is a linear problem and can be solved by adaptive filtering. However, in practice, two important problems severely affect the performance of AEC, i.e. 1) double-talk problem and 2) nonlinear distortion mainly caused by loudspeakers and/or power amplifiers. Considering these two problems in AEC, we propose a novel cascaded AEC which integrates adaptive filtering and deep learning. Specifically, two long short-term memory networks (LSTM) are employed for double-talk detection (DTD) and nonlinearity modeling, respectively. The adaptive filtering is employed to remove the linear part of echo. Experimental results show that the proposed method outperforms conventional methods in terms of the objective evaluation metrics by a considerable margin in the matched scenario. Moreover, the proposed method has much better generalization ability in the unmatched scenarios, compared with end-to-end deep learning method.


 DOI: 10.21437/Interspeech.2020-1260

Cite as: Zhang, C., Zhang, X. (2020) A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning. Proc. Interspeech 2020, 3940-3944, DOI: 10.21437/Interspeech.2020-1260.


@inproceedings{Zhang2020,
  author={Chenggang Zhang and Xueliang Zhang},
  title={{A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={3940--3944},
  doi={10.21437/Interspeech.2020-1260},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1260}
}