Evaluation of VTLN-based voice conversion for embedded speech synthesis

Sundermann, David; Strecha, Guntram; Bonafonte, Antonio; Höge, Harald; Ney, Hermann

doi:10.21437/Interspeech.2005-803

Evaluation of VTLN-based voice conversion for embedded speech synthesis

David Sundermann, Guntram Strecha, Antonio Bonafonte, Harald Höge, Hermann Ney

Recently, we demonstrated that vocal tract length normalization (VTLN) can be applied to voice conversion tasks. In particular, when the conversion algorithm is performed in time domain, this technique is very resource-efficient and, consequently, suitable for embedded applications. In this paper, we use VTLN-based voice conversion as a novel feature of a small footprint speech synthesizer running on mobile devices. The characteristics of this feature are investigated by means of extensive subjective tests.

doi: 10.21437/Interspeech.2005-803

Cite as: Sundermann, D., Strecha, G., Bonafonte, A., Höge, H., Ney, H. (2005) Evaluation of VTLN-based voice conversion for embedded speech synthesis. Proc. Interspeech 2005, 2593-2596, doi: 10.21437/Interspeech.2005-803

@inproceedings{sundermann05_interspeech,
  author={David Sundermann and Guntram Strecha and Antonio Bonafonte and Harald Höge and Hermann Ney},
  title={{Evaluation of VTLN-based voice conversion for embedded speech synthesis}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2593--2596},
  doi={10.21437/Interspeech.2005-803}
}