Adversarial Audio: A New Information Hiding Method

Yehao Kong, Jiliang Zhang


Audio is an important medium in people’s daily life, hidden information can be embedded into audio for covert communication. Current audio information hiding techniques can be roughly classified into time domain-based and transform domain-based techniques. Time domain-based techniques have large hiding capacity but low imperceptibility. Transform domain-based techniques have better imperceptibility, but the hiding capacity is poor. This paper proposes a new audio information hiding technique which shows high hiding capacity and good imperceptibility. The proposed audio information hiding method takes the original audio signal as input and obtains the audio signal embedded with hidden information (called stego audio) through the training of our private DNN-based automatic speech recognition (ASR) model. The experimental results show that the proposed audio information hiding technique has a high hiding capacity of 48 cps with good imperceptibility and high security.


 DOI: 10.21437/Interspeech.2020-1294

Cite as: Kong, Y., Zhang, J. (2020) Adversarial Audio: A New Information Hiding Method. Proc. Interspeech 2020, 2287-2291, DOI: 10.21437/Interspeech.2020-1294.


@inproceedings{Kong2020,
  author={Yehao Kong and Jiliang Zhang},
  title={{Adversarial Audio: A New Information Hiding Method}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={2287--2291},
  doi={10.21437/Interspeech.2020-1294},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1294}
}