Segmentation of speech into syllable-like units

Nagarajan, T.; Murthy, Hema A.; Hegde, Rajesh M.

doi:10.21437/Eurospeech.2003-48

Segmentation of speech into syllable-like units

T. Nagarajan, Hema A. Murthy, Rajesh M. Hegde

In the development of a syllable-centric ASR system, segmentation of the acoustic signal into syllabic units is an important stage. This paper presents a minimum phase group delay based approach to segment spontaneous speech into syllable-like units. Here, three different minimum phase signals are derived from the short term energy functions of three sub-bands of speech signals, as if it were a magnitude spectrum. The experiments are carried out on Switchboard and OGI-MLTS corpus and the error in segmentation is found to be utmost 40msec for 85% of the syllable segments.

doi: 10.21437/Eurospeech.2003-48

Cite as: Nagarajan, T., Murthy, H.A., Hegde, R.M. (2003) Segmentation of speech into syllable-like units. Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 2893-2896, doi: 10.21437/Eurospeech.2003-48

@inproceedings{nagarajan03_eurospeech,
  author={T. Nagarajan and Hema A. Murthy and Rajesh M. Hegde},
  title={{Segmentation of speech into syllable-like units}},
  year=2003,
  booktitle={Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003)},
  pages={2893--2896},
  doi={10.21437/Eurospeech.2003-48}
}