Skip to main content

Prosodic Parameters of Emotional Synthetic Speech in Czech: Perception Validation

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7015))

Included in the following conference series:

Abstract

This study concerns the influence of basic prosodic parameters on the perception and identification of emotions. Emotional sentences have been generated by the TTS system with high naturalness of speech using modification by hand of pitch contour, intensity and duration as well as their combinations. The prosody of sentences has been modified in order to express four emotions: anger, fear, joy and boredom, and this for both male and female voices. Subsequently, the sentences with emotions modelled by means of prosody have been applied to the listening tests, which uncovered the importance of different parameters for the identification of different emotion. The results show that the identification of different emotions is based on relevant changes of different parameters and their combinations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hanika, J., Horák, P.: Epos – A New Approach to the Speech Synthesis. In: Proceedings of the First Workshop on Text, Speech and Dialogue – TSD 1998, Brno, Czech Republic, pp. 51–54 (1998)

    Google Scholar 

  2. Hesounová, A.: New Design of Combined Inventory for Czech Text-to-Speech Synthesis. In: Speech Processing, Proceedings of 10th Czech-German Workshop, Prague, pp. 19–20 (September 2000)

    Google Scholar 

  3. Horák, P., Mejvaldová, J.: Software Tools in Czech Phonetic Research. In: Palková, Z., Wodarz, H.-W. (eds.) Forum Phoneticum, vol. 70, pp. 41–50. Hector Verlag, Frankfurt am Main (2000)

    Google Scholar 

  4. Hanika, J., Horák, P.: Dependences and Independences of Text-to-Speech. In: Palková, Z., Wodarz, H.-W. (eds.) Forum Phoneticum, vol. 70, pp. 27–40. Hector Verlag, Frankfurt am Main (2000)

    Google Scholar 

  5. Horák, P., Hesounová, A.: Czech Triphone Synthesis of Female Voice. In: Proceedings of 11th Czech-German Workshop on Speech Processing, Prague, pp. 32–33 (September 2001)

    Google Scholar 

  6. Epos TTS system documentation, http://epos.ufe.cz/

  7. Dohalská, M., Mejvaldová, J., Duběda, T.: Prosodic Parameters of Synthetic Czech: Can We Manage without Duration and Intensity? In: Keller, E., Bailly, G., et al. (eds.) Improvements in Speech Synthesis, pp. 129–133. Wiley & Sons, Chichester (2001)

    Chapter  Google Scholar 

  8. Dohalská-Zichová, M., Mejvaldová, J.: Où sont les limites phonostylistiques du tchèque synthétique, Actes du XVIe Congrès International des Linguistes, Paris, CD (1997)

    Google Scholar 

  9. Chaloupka, Z., Horák, P.: Prosody Modelling Possibilities of the Czech Emotional Speech. In: Proceedings of 19th Czech-German Workshop Speech Processing, Prague, pp. 114–117 (September 2009)

    Google Scholar 

  10. Praat: doing phonetics by computer, http://www.praat.org/

  11. Vlčková-Mejvaldová, J.: Prosodic Changes in Emotional speech. In: Vích, R. (ed.) 16th Conference Electronic Speech Signal Processing, pp. 38–45. TUD Press, Dresden (2005)

    Google Scholar 

  12. Prasanna, S.R.M., Govind, D.: Analysis of Excitation Source Information on Emotional Speech. In: Interspeech 2011, Japan, pp. 781–784 (2011)

    Google Scholar 

  13. Vlčková-Mejvaldová, J., Dohalská, M.: Jeu de sons, jeu de chiffres ou modélisation des phrases marquées. In: Palková, Z., Veroňková, J. (eds.) Phonetica Pragensia, pp. 113–124. Acta Universitatis Carolinae, Praha, Karolinum (2004)

    Google Scholar 

  14. Vondra, M., Vích, R., Horák, P.: Czech Acted Emotional Speech Database. In: Proceedings of 19th Czech-German Workshop Speech Processing, Prague, pp. 118–120 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vlčková-Mejvaldová, J., Horák, P. (2011). Prosodic Parameters of Emotional Synthetic Speech in Czech: Perception Validation. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25020-0_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25019-4

  • Online ISBN: 978-3-642-25020-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics