Development of a Speech Quality Database Under Uncontrolled Conditions

Alessandro Ragano, Emmanouil Benetos, Andrew Hines


Objective audio quality assessment is preferred to avoid time-consuming and costly listening tests. The development of objective quality metrics depends on the availability of datasets appropriate to the application under study. Currently, a suitable human-annotated dataset for developing quality metrics in archive audio is missing. Given the online availability of archival recordings, we propose to develop a real-world audio quality dataset. We present a methodology used to curate a speech quality database using the archive recordings from the Apollo Space Program. The proposed procedure is based on two steps: a pilot listening test and an exploratory data analysis. The pilot listening test shows that we can extract audio clips through the control of speech-to-text performance metrics to prevent data repetition. Through unsupervised exploratory data analysis, we explore the characteristics of the degradations. We classify distinct degradations and we study spectral, intensity, tonality and overall quality properties of the data through clustering techniques. These results provide the necessary foundation to support the subsequent development of large-scale crowdsourced datasets for audio quality.


 DOI: 10.21437/Interspeech.2020-1899

Cite as: Ragano, A., Benetos, E., Hines, A. (2020) Development of a Speech Quality Database Under Uncontrolled Conditions. Proc. Interspeech 2020, 4616-4620, DOI: 10.21437/Interspeech.2020-1899.


@inproceedings{Ragano2020,
  author={Alessandro Ragano and Emmanouil Benetos and Andrew Hines},
  title={{Development of a Speech Quality Database Under Uncontrolled Conditions}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={4616--4620},
  doi={10.21437/Interspeech.2020-1899},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1899}
}