Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification

Xu Li, Na Li, Jinghua Zhong, Xixin Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng


Recently adversarial attacks on automatic speaker verification (ASV) systems attracted widespread attention as they pose severe threats to ASV systems. However, methods to defend against such attacks are limited. Existing approaches mainly focus on retraining ASV systems with adversarial data augmentation. Also, countermeasure robustness against different attack settings are insufficiently investigated. Orthogonal to prior approaches, this work proposes to defend ASV systems against adversarial attacks with a separate detection network, rather than augmenting adversarial data into ASV training. A VGG-like binary classification detector is introduced and demonstrated to be effective on detecting adversarial samples. To investigate detector robustness in a realistic defense scenario where unseen attack settings may exist, we analyze various kinds of unseen attack settings’ impact and observe that the detector is robust (6.27% EERdet degradation in the worst case) against unseen substitute ASV systems, but it has weak robustness (50.37% EERdet degradation in the worst case) against unseen perturbation methods. The weak robustness against unseen perturbation methods shows a direction for developing stronger countermeasures.


 DOI: 10.21437/Interspeech.2020-2441

Cite as: Li, X., Li, N., Zhong, J., Wu, X., Liu, X., Su, D., Yu, D., Meng, H. (2020) Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification. Proc. Interspeech 2020, 1540-1544, DOI: 10.21437/Interspeech.2020-2441.


@inproceedings{Li2020,
  author={Xu Li and Na Li and Jinghua Zhong and Xixin Wu and Xunying Liu and Dan Su and Dong Yu and Helen Meng},
  title={{Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={1540--1544},
  doi={10.21437/Interspeech.2020-2441},
  url={http://dx.doi.org/10.21437/Interspeech.2020-2441}
}