This paper describes a frequency-domain method of extracting the fundamental frequency of voiced speech which has been band-limited to 300 Hz to 3. 4 KHz. The method uses a linear auditory model into which non-linearity has been introduced. Two methods for introducing the non-linearity into the model are described. Harmonic product spectra are derived from the outputs of the linear and non-linear auditory models. Results show that the spectrum derived from the output of the non-linear auditory model is superior to that obtained from the output of the linear model. Keywords: auditory modelling, speech processing, pitch extraction.
Cite as: Jones, E., Ambikairajah, E. (1991) A perceptually-based pitch extractor for band-limited speech. Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), 449-452, doi: 10.21437/Eurospeech.1991-113
@inproceedings{jones91_eurospeech, author={Edward Jones and Eliathamby Ambikairajah}, title={{A perceptually-based pitch extractor for band-limited speech}}, year=1991, booktitle={Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991)}, pages={449--452}, doi={10.21437/Eurospeech.1991-113} }