Recently, we demonstrated that vocal tract length normalization (VTLN) can be applied to voice conversion tasks. In particular, when the conversion algorithm is performed in time domain, this technique is very resource-efficient and, consequently, suitable for embedded applications. In this paper, we use VTLN-based voice conversion as a novel feature of a small footprint speech synthesizer running on mobile devices. The characteristics of this feature are investigated by means of extensive subjective tests.
Cite as: Sundermann, D., Strecha, G., Bonafonte, A., Höge, H., Ney, H. (2005) Evaluation of VTLN-based voice conversion for embedded speech synthesis. Proc. Interspeech 2005, 2593-2596, doi: 10.21437/Interspeech.2005-803
@inproceedings{sundermann05_interspeech, author={David Sundermann and Guntram Strecha and Antonio Bonafonte and Harald Höge and Hermann Ney}, title={{Evaluation of VTLN-based voice conversion for embedded speech synthesis}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={2593--2596}, doi={10.21437/Interspeech.2005-803} }