ISCA Archive Interspeech 2017
ISCA Archive Interspeech 2017

Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs

Ana Ramírez López, Shreyas Seshadri, Lauri Juvela, Okko Räsänen, Paavo Alku

Speaking style conversion is the technology of converting natural speech signals from one style to another. In this study, we focus on normal-to-Lombard conversion. This can be used, for example, to enhance the intelligibility of speech in noisy environments. We propose a parametric approach that uses a vocoder to extract speech features. These features are mapped using Bayesian GMMs from utterances spoken in normal style to the corresponding features of Lombard speech. Finally, the mapped features are converted to a Lombard speech waveform with the vocoder. Two vocoders were compared in the proposed normal-to-Lombard conversion: a recently developed glottal vocoder that decomposes speech into glottal flow excitation and vocal tract, and the widely used STRAIGHT vocoder. The conversion quality was evaluated in two subjective listening tests measuring subjective similarity and naturalness. The similarity test results show that the system is able to convert normal speech into Lombard speech for the two vocoders. However, the subjective naturalness of the converted Lombard speech was clearly better using the glottal vocoder in comparison to STRAIGHT.


doi: 10.21437/Interspeech.2017-400

Cite as: López, A.R., Seshadri, S., Juvela, L., Räsänen, O., Alku, P. (2017) Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs. Proc. Interspeech 2017, 1363-1367, doi: 10.21437/Interspeech.2017-400

@inproceedings{lopez17_interspeech,
  author={Ana Ramírez López and Shreyas Seshadri and Lauri Juvela and Okko Räsänen and Paavo Alku},
  title={{Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs}},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={1363--1367},
  doi={10.21437/Interspeech.2017-400},
  issn={2308-457X}
}