In this paper the phoneme errors in adaptation data of HMM based synthesis is investigated. Phoneme errors are likely to appear in automatic speech recognition (ASR) based transcriptions. The research also investigates the perspective of merely ASR transcription based unsupervised adaptation. To achieve better quality a new method is introduced for selecting an optimal subset of ASR transcription based adaptation data. Quality evaluation of the method was also performed. The results showed that adaptation was successful even on higher than 50% phoneme error rates.
Bibliographic reference. Tóth, Bálint / Fegyó, Tibor / Németh, Géza (2011): "The effects of phoneme errors in speaker adaptation for HMM speech synthesis", In INTERSPEECH-2011, 2805-2808.