Objective Speech Quality Estimation of In-Ear Microphone Speech

João Felipe Santos, Rachel Bouserhal, Jeremie Voix, Tiago Falk


Speech captured from an in-ear microphone (IEM) under an intra-aural device is beneficial in extremely noisy environments as it maintains a relatively high signal to noise ratio. Due to its limited bandwidth, speech enhancement is required in order to obtain a more natural speech. Consequently, quick and practical measurement of speech quality is important. In this paper, we compare the performance of the quality of intrusive and non-intrusive objective quality metrics on IEM speech, and propose an adaptation of a non-intrusive metric, the speech-to-reverberation modulation energy ratio (SRMR) to IEM speech signals. Changes are implemented to take into account the effect of the occluded ear on the recorded speech signals, which causes an amplification in the bone conduction sounds in the ear canal. We show that the updated SRMR metric, SRMRIEM, significantly reduces the performance gap between nonintrusive and intrusive metrics.


 DOI: 10.21437/PQS.2016-15

Cite as: Felipe Santos, J., Bouserhal, R., Voix, J., Falk, T. (2016) Objective Speech Quality Estimation of In-Ear Microphone Speech. Proc. 5th ISCA/DEGA Workshop on Perceptual Quality of Systems (PQS 2016), 69-73, DOI: 10.21437/PQS.2016-15.


@inproceedings{Felipe Santos2016,
  author={João {Felipe Santos} and Rachel Bouserhal and Jeremie Voix and Tiago Falk},
  title={Objective Speech Quality Estimation of In-Ear Microphone Speech},
  year=2016,
  booktitle={Proc. 5th ISCA/DEGA Workshop on Perceptual Quality of Systems (PQS 2016)},
  pages={69--73},
  doi={10.21437/PQS.2016-15},
  url={http://dx.doi.org/10.21437/PQS.2016-15}
}