On the use of phone duration and segmental processing to label speech signal

Depambour, Philippe; Andre-Obrecht, Regine; Delyon, Bernard

doi:10.21437/Eurospeech.1997-465

On the use of phone duration and segmental processing to label speech signal

Philippe Depambour, Regine Andre-Obrecht, Bernard Delyon

This paper presents recent work on continuous speech labelling. We propose an original automatic labelling system where elementary phone models take a segmental analysis and the phone duration into account. These models are initialized by a short speaker-independent training stage in order to constitute a model database. From the standard phonetic transcription, phonological rules are gathered to process the various pronunciations. For each new corpus or speaker, a new quick unsupervised adaptation stage is performed to re-estimate the models, and then follows the correct labelling. We assess this system by labelling a difficult corpus (sequences of connected spelled letter) and sentences of one speaker of the BREF80 corpus. These results are quite promising, in the two experiments less than 9% of phonetic boundaries are incorrectly located.

doi: 10.21437/Eurospeech.1997-465

Cite as: Depambour, P., Andre-Obrecht, R., Delyon, B. (1997) On the use of phone duration and segmental processing to label speech signal. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1627-1630, doi: 10.21437/Eurospeech.1997-465

@inproceedings{depambour97_eurospeech,
  author={Philippe Depambour and Regine Andre-Obrecht and Bernard Delyon},
  title={{On the use of phone duration and segmental processing to label speech signal}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1627--1630},
  doi={10.21437/Eurospeech.1997-465}
}