In the development of a syllable-centric ASR system, segmentation of the acoustic signal into syllabic units is an important stage. This paper presents a minimum phase group delay based approach to segment spontaneous speech into syllable-like units. Here, three different minimum phase signals are derived from the short term energy functions of three sub-bands of speech signals, as if it were a magnitude spectrum. The experiments are carried out on Switchboard and OGI-MLTS corpus and the error in segmentation is found to be utmost 40msec for 85% of the syllable segments.
Cite as: Nagarajan, T., Murthy, H.A., Hegde, R.M. (2003) Segmentation of speech into syllable-like units. Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 2893-2896, doi: 10.21437/Eurospeech.2003-48
@inproceedings{nagarajan03_eurospeech, author={T. Nagarajan and Hema A. Murthy and Rajesh M. Hegde}, title={{Segmentation of speech into syllable-like units}}, year=2003, booktitle={Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003)}, pages={2893--2896}, doi={10.21437/Eurospeech.2003-48} }