POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise

Kosuke Akimoto, Seng Pei Liew, Sakiko Mishima, Ryo Mizushima, Kong Aik Lee


We present a new database of voice recordings with the goal of promoting research on protection of automatic speaker verification systems from voice spoofing, such as replay attacks. Specifically, we focus on the liveness feature of live speech, i.e., pop noise, and the corresponding voice recordings without this feature, for the purpose of combating spoofing via liveness detection. Our database includes simultaneous recordings using a microphone array, as well as recordings at various distances and positions. To the best of our knowledge, this is the first publicly available database that has been particularly designed to study the liveness features of voice recordings under various conditions.1


 DOI: 10.21437/Interspeech.2020-1243

Cite as: Akimoto, K., Liew, S.P., Mishima, S., Mizushima, R., Lee, K.A. (2020) POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise. Proc. Interspeech 2020, 1081-1085, DOI: 10.21437/Interspeech.2020-1243.


@inproceedings{Akimoto2020,
  author={Kosuke Akimoto and Seng Pei Liew and Sakiko Mishima and Ryo Mizushima and Kong Aik Lee},
  title={{POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={1081--1085},
  doi={10.21437/Interspeech.2020-1243},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1243}
}