This paper describes pitch extraction of telephone-bandwidth speech using a combined place-temporal approach. The method makes use of a nonlinear auditory model which regenerates harmonics below 300 Hz which have been removed from the speech by the bandlimiting effect of a telephone channel. The spectral representation produced by the nonlinear auditory model is further processed using the Harmonic Product Spectrum to give an initial estimate of the fundamental frequency, using only place information. This initial estimate is then refined by temporal analysis on the detailed time response of a single section of the auditory model.
Bibliographic reference. Jones, E. / Ambikairajah, E. (1995): "Pitch extraction of telephone bandwidth speech using a place-temporal approach", In EUROSPEECH-1995, 123-126.