4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
This paper describes a approach to speech segmentation. Unlike approaches based on spectral measurements, our algorithm iteratively clusters on an LPC representation of time waveform blocks. The algorithm uses a generalized maximum likelihood criterion for deciding when two neighboring pieces of the signal should be joined. This paper describes the algorithm and shows that it yields superior results when compared to metrics based on spectral or cepstral measurements.
Bibliographic reference. Eberman, Brian / Goldenthal, William (1996): "Time-based clustering for phonetic segmentation", In ICSLP-1996, 1225-1228.