Estimation of Hypernasality Scores from Cleft Lip and Palate Speech

Vikram C M, Ayush Tripathi, Sishir Kalita, S R Mahadeva Prasanna

Hypernasality refers to the perception of excessive nasal resonances in vowels and voiced consonants. Existing speech processing based approaches concentrate only on the classification of speech into normal or hypernasal, which do not give the degree of hypernasality in terms of continuous values like nasometer. Motivated by the functionality of nasometer, in this work, a method is proposed for the evaluation of hypernasality. Speech signals representing two extremely opposite cases of nasality are used to develop the acoustic models, where oral sentences (rich in vowels, stops and fricatives) of normal speakers and nasal sentences (rich in nasals and nasalized vowels) of moderate-severe hypernasal speakers represent the groups with minimum and maximum attainable degrees of nasality, respectively. The acoustic features derived from glottal activity regions are used to model the maximum and minimum nasality classes using Gaussian mixture model and deep neural network approaches. The posterior probabilities obtained for nasal sentence class are referred to as hypernasality scores. The scores show a significant correlation (p<0.01) with respect to perceptual ratings of hypernasality, provided by expert speech-language pathologists. Further, hypernasality scores are used for the detection of hypernasality and the results are compared with the nasometer based approach.

 DOI: 10.21437/Interspeech.2018-1631

Cite as: M, V.C., Tripathi, A., Kalita, S., Prasanna, S.R.M. (2018) Estimation of Hypernasality Scores from Cleft Lip and Palate Speech. Proc. Interspeech 2018, 1701-1705, DOI: 10.21437/Interspeech.2018-1631.

  author={Vikram C M and Ayush Tripathi and Sishir Kalita and S R Mahadeva Prasanna},
  title={Estimation of Hypernasality Scores from Cleft Lip and Palate Speech},
  booktitle={Proc. Interspeech 2018},