Skip to main content

The Relationship Between Pauses and Emphasis: Implications for Charismatic Speech Synthesis

  • Conference paper
  • First Online:
Human-Computer Interaction (HCII 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14013))

Included in the following conference series:

  • 804 Accesses

Abstract

Animated voice is engaging, memorable, and entertaining. It is also an important aspect of charismatic behavior. For animated voices, the variations in speed, intensity, and intonation play a role in the perception of charisma. Changes in speed give rise to pauses. And speakers, especially charismatic ones, often use the silence created by pauses to contrast with the emphasis to follow, such as the words spoken with high intensity. How can we realize such behaviors in a virtual character? In this paper, we discuss our work toward the synthesis of charismatic speeches. We collected voice recordings of a tutorial on the human circulatory system in both charismatic and non-charismatic voices using actors from a crowd-sourcing platform. Those recordings were then annotated for occurrence of emphasis. In this paper, we present the analysis of the pauses and emphasis of charismatic and non-charismatic voice recordings, and discuss how they relate to each other and how such findings can inform the synthesis of charismatic speeches for virtual characters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Al Mahmud, A., Dadlani, P., Mubin, O., Shahid, S., Midden, C., Moran, O.: iParrot: towards designing a persuasive agent for energy conservation. In: de Kort, Y., IJsselsteijn, W., Midden, C., Eggen, B., Fogg, B.J. (eds.) PERSUASIVE 2007. LNCS, vol. 4744, pp. 64–67. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-77006-0_8

    Chapter  Google Scholar 

  2. Amazon: Amazon Polly (2022). https://aws.amazon.com/polly/

  3. Antonakis, J., Fenley, M., Liechti, S.: Can charisma be taught? tests of two interventions. Acad. Manage. Learn. Educ. 10(3), 374–396 (2011)

    Article  Google Scholar 

  4. Atkinson, J.M.: Lend me your ears: All you need to know about making speeches and presentations. Oxford University Press on Demand (2005)

    Google Scholar 

  5. Awamleh, R., Gardner, W.L.: Perceptions of leader charisma and effectiveness: the effects of vision content, delivery, and organizational performance. Leadersh. Q. 10(3), 345–373 (1999)

    Article  Google Scholar 

  6. Bandura, A., Walters, R.H.: Social Learning Theory, vol. 1. Prentice-hall Englewood Cliffs, NJ (1977)

    Google Scholar 

  7. Beyer, J.M.: Taming and promoting charisma to change organizations. Leadersh. Q. 10(2), 307–330 (1999)

    Article  Google Scholar 

  8. Bickmore, T.W., Pfeifer, L.M., Jack, B.W.: Taking the time to care: empowering low health literacy hospital patients with virtual nurse agents. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1265–1274 (2009)

    Google Scholar 

  9. Bower, G.H.: Experiments on story understanding and recall. Q. J. Exp. Psychol. 28(4), 511–534 (1976)

    Article  Google Scholar 

  10. Burkhardt, F., Sendlmeier, W.F.: Verification of acoustical correlates of emotional speech using formant-synthesis. In: ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion (2000)

    Google Scholar 

  11. Campione, E., Véronis, J.: A large-scale multilingual study of silent pause duration. In: Speech prosody 2002, International Conference (2002)

    Google Scholar 

  12. Charteris-Black, J.: Politicians and rhetoric: The persuasive power of metaphor. Springer (2011)

    Google Scholar 

  13. Chi, M.T., Siler, S.A., Jeong, H., Yamauchi, T., Hausmann, R.G.: Learning from human tutoring. Cogn. Sci. 25(4), 471–533 (2001)

    Article  Google Scholar 

  14. Conger, J., Kanungo, R.: Behavioral dimensions of charismatic leadership’. in (eds, conger, ja and kanango, rn) charismatic leadership: The elusive factor in organizational effectiveness (1988)

    Google Scholar 

  15. DeGroot, T., Aime, F., Johnson, S.G., Kluemper, D.: Does talking the talk help walking the walk? an examination of the effect of vocal attractiveness in leader effectiveness. Leadersh. Q. 22(4), 680–689 (2011)

    Article  Google Scholar 

  16. Den Hartog, D.N., Verburg, R.M.: Charisma and rhetoric: communicative techniques of international business leaders. Leadersh. Q. 8(4), 355–391 (1997)

    Article  Google Scholar 

  17. Dumdum, U.R., Lowe, K.B., Avolio, B.J.: A meta-analysis of transformational and transactional leadership correlates of effectiveness and satisfaction: an update and extension. In: Transformational and Charismatic Leadership: The Road Ahead 10th Anniversary Edition, pp. 39–70. Emerald Group Publishing Limited (2013)

    Google Scholar 

  18. Eden, D.: Pygmalion, goal setting, and expectancy: compatible ways to boost productivity. Acad. Manag. Rev. 13(4), 639–652 (1988)

    Article  Google Scholar 

  19. Eden, D., et al.: Implanting pygmalion leadership style through workshop training: seven field experiments. Leadersh. Q. 11(2), 171–210 (2000)

    Article  Google Scholar 

  20. Emrich, C.G., Brower, H.H., Feldman, J.M., Garland, H.: Images in words: presidential rhetoric, charisma, and greatness. Adm. Sci. Q. 46(3), 527–557 (2001)

    Article  Google Scholar 

  21. Frese, M., Beimel, S., Schoenborn, S.: Action training for charismatic leadership: two evaluations of studies of a commercial training module on inspirational communication of a vision. Pers. Psychol. 56(3), 671–698 (2003)

    Article  Google Scholar 

  22. Frick, R.W.: Communicating emotion: the role of prosodic features. Psychol. Bull. 97(3), 412 (1985)

    Article  Google Scholar 

  23. Gobl, C., Bennett, E., Chasaide, A.N.: Expressive synthesis: how crucial is voice quality? In: Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002, pp. 91–94. IEEE (2002)

    Google Scholar 

  24. Gobl, C., Chasaide, A.N.: The role of voice quality in communicating emotion, mood and attitude. Speech Commun. 40(1–2), 189–212 (2003)

    Article  MATH  Google Scholar 

  25. Goldman-Eisler, F.: The distribution of pause durations in speech. Lang. Speech 4(4), 232–237 (1961)

    Article  Google Scholar 

  26. House, R.J.: A 1976 theory of charismatic leadership. working paper series, 76–06 (1976)

    Google Scholar 

  27. Igras-Cybulska, M., Ziółko, B., Żelasko, P., Witkowski, M.: Structure of pauses in speech in the context of speaker verification and classification of speech type. EURASIP J. Audio Speech Music Process. 2016(1), 1–16 (2016). https://doi.org/10.1186/s13636-016-0096-7

    Article  Google Scholar 

  28. Judge, T.A., Piccolo, R.F.: Transformational and transactional leadership: a meta-analytic test of their relative validity. J. Appl. Psychol. 89(5), 755 (2004)

    Article  Google Scholar 

  29. Kirsner, K., Dunn, J., Hird, K.: Language production: a complex dynamic system with a chronometric footprint (2005)

    Google Scholar 

  30. Lea, W.A.: Trends in speech recognition. Prentice Hall PTR (1980)

    Google Scholar 

  31. Lee, C.M., et al.: Emotion recognition based on phoneme classes. In: Eighth International Conference on Spoken Language Processing (2004)

    Google Scholar 

  32. Locke, E.A., Latham, G.P.: Building a practically useful theory of goal setting and task motivation: a 35-year odyssey. Am. Psychol. 57(9), 705 (2002)

    Article  Google Scholar 

  33. Martha, A.S.D., Santoso, H.B.: The design and impact of the pedagogical agent: a systematic literature review. J. Educators Online 16(1), n1 (2019)

    Article  Google Scholar 

  34. Mio, J.S.: Metaphor and politics. Metaphor and symbol 12(2), 113–133 (1997)

    Article  Google Scholar 

  35. Mio, J.S., Riggio, R.E., Levin, S., Reese, R.: Presidential leadership and charisma: the effects of metaphor. Leadersh. Q. 16(2), 287–294 (2005)

    Article  Google Scholar 

  36. Rochester, S.R.: The significance of pauses in spontaneous speech. J. Psycholinguist. Res. 2(1), 51–81 (1973)

    Article  Google Scholar 

  37. Roehling, S., MacDonald, B., Watson, C.: Towards expressive speech synthesis in english on a robotic platform. In: Proceedings of the Australasian International Conference on Speech Science and Technology, pp. 130–135 (2006)

    Google Scholar 

  38. Rosenberg, A., Hirschberg, J.: Charisma perception from text and speech. Speech Commun. 51(7), 640–655 (2009)

    Article  Google Scholar 

  39. Scott, M.G.: Max weber: On charisma and institution building (1970)

    Google Scholar 

  40. Shamir, B., Arthur, M.B., House, R.J.: The rhetoric of charismatic leadership: a theoretical extension, a case study, and implications for research. Leadersh. Q. 5(1), 25–42 (1994)

    Article  Google Scholar 

  41. Shamir, B., Arthur, M.B., House, R.J.: The rhetoric of charismatic leadership: a theoretical extension, a case study, and implications for research. In: Leadership Now: Reflections on the Legacy of Boas Shamir, pp. 31–49. Emerald Publishing Limited (2018)

    Google Scholar 

  42. Shamir, B., House, R.J., Arthur, M.B.: The motivational effects of charismatic leadership: a self-concept based theory. Organ. Sci. 4(4), 577–594 (1993)

    Article  Google Scholar 

  43. Strangert, E.: Emphasis by pausing. In: Proc. 15th ICPhS, Barcelona, pp. 2477–2480 (2003)

    Google Scholar 

  44. Touati, P.: Prosodic aspects of political rhetoric. In: ESCA Workshop on Prosody (1993)

    Google Scholar 

  45. Towler, A.J.: Effects of charismatic influence training on attitudes, behavior, and performance. Pers. Psychol. 56(2), 363–381 (2003)

    Article  Google Scholar 

  46. Wang, N., Karpurapu, A., Jajodia, A., Merchant, C.: Toward charismatic virtual agents: How to animate your speech and be charismatic. In: Human-Computer Interaction. User Experience and Behavior: Thematic Area, HCI 2022, Held as Part of the 24th HCI International Conference, HCII 2022, Virtual Event, June 26-July 1, 2022, Proceedings, Part III. pp. 580–590. Springer (2022)

    Google Scholar 

  47. Wang, N., Pacheco, L., Merchant, C., Skistad, K., Jethwani, A.: The design of charismatic behaviors for virtual humans. In: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents, pp. 1–8 (2020)

    Google Scholar 

  48. Wasielewski, P.L.: The emotional basis of charisma. Symb. Interact. 8(2), 207–222 (1985)

    Article  Google Scholar 

  49. Weber, M.: The theory of social and economic organization. Simon and Schuster (2009)

    Google Scholar 

  50. Williams, C.E., Stevens, K.N.: Emotions and speech: some acoustical correlates. J. Acoustical Soc. Am. 52(4B), 1238–1250 (1972)

    Article  Google Scholar 

  51. Willner, A.R.: The spellbinders: Charismatic political leadership. Yale University Press (1985)

    Google Scholar 

Download references

Acknowledgement

This research was supported by the National Science Foundation under Grant #1816966. Any opinions, findings, and conclusions expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ning Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, N., Karpurapu, A., Jajodia, A., Merchant, C. (2023). The Relationship Between Pauses and Emphasis: Implications for Charismatic Speech Synthesis. In: Kurosu, M., Hashizume, A. (eds) Human-Computer Interaction. HCII 2023. Lecture Notes in Computer Science, vol 14013. Springer, Cham. https://doi.org/10.1007/978-3-031-35602-5_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-35602-5_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-35601-8

  • Online ISBN: 978-3-031-35602-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics