Transformation of voice quality in singing using glottal source features

João P. Cabral, Alexsandro R. Meireles


Glottal activity information can be very important in several speech processing applications, such as in speech therapy, voice disorder diagnosis, voice transformation and text-to-speech synthesis. However, the use of algorithms for estimating glottal parameters from the speech signal is very limited in those applications because of problems with robustness and accuracy. In singing synthesis, the glottal source representation is also very important because it is closely related with the emotions and singing style. This paper proposes a robust method to estimate the voice quality parameters of the glottal source by using both the electroglottographic (EGG) signal and the acoustic recordings of singing voice for five vowels in three different voice qualities: modal, breathy and creaky. The analysis of the resulting measurements permitted to confirm that voice quality parameters of the glottal source are correlated with the type of voice. Moreover, another experiment was conducted to show that it is possible to transform the modal singing voice into breathy and creaky by using an analysis-synthesis method that incorporates a glottal source model.


 DOI: 10.21437/SMM.2019-7

Cite as: Cabral, J.P., Meireles, A.R. (2019) Transformation of voice quality in singing using glottal source features. Proc. SMM19, Workshop on Speech, Music and Mind 2019, 31-35, DOI: 10.21437/SMM.2019-7.


@inproceedings{Cabral2019,
  author={João P. Cabral and Alexsandro R. Meireles},
  title={{Transformation of voice quality in singing using glottal source features}},
  year=2019,
  booktitle={Proc. SMM19, Workshop on Speech, Music and Mind 2019},
  pages={31--35},
  doi={10.21437/SMM.2019-7},
  url={http://dx.doi.org/10.21437/SMM.2019-7}
}