Abstract
When a smart device talks, what should its voice sound like? Voice-enabled devices are becoming a ubiquitous presence in our everyday lives. Simultaneously, speech synthesis technology is rapidly improving, making it possible to generate increasingly varied and realistic computerized voices. Despite the flexibility and richness of expression that technology now affords, today's most common voice assistants often have female-sounding, polite, and playful voices by default. In this paper, we examine the social consequences of voice design, and introduce a simple research framework for understanding how voice affects how we perceive and interact with smart devices. Based on the foundational paradigm of computers as social actors, and informed by research in human-robot interaction, this framework demonstrates how voice design depends on a complex interplay between characteristics of the user, device, and context. Through this framework, we propose a set of guiding questions to inform future research in the space of voice design for smart devices.
- Nalini Ambady and Robert Rosenthal. 1993. Half a minute: Predicting teacher evaluations from thin slices of nonverbal behavior and physical attractiveness. Journal of personality and social psychology , Vol. 64, 3 (1993), 431.Google ScholarCross Ref
- Tawfiq Ammari, Jofish Kaye, Janice Y. Tsai, and Frank Bentley. 2019. Music, Search, and IoT: How People (Really) Use Voice Assistants . ACM Trans. Comput.-Hum. Interact. , Vol. 26, 3 (April 2019), 17:1--17:28. https://doi.org/10.1145/3311956Google ScholarDigital Library
- Guozhen An, Sarah Ita Levitan, Julia Hirschberg, and Rivka Levitan. 2018. Deep Personality Recognition for Deception Detection. In Proc. Interspeech 2018 . 421--425. https://doi.org/10.21437/Interspeech.2018--2269Google ScholarCross Ref
- Sean Andrist, Micheline Ziadee, Halim Boukaram, Bilge Mutlu, and Majd Sakr. 2015. Effects of Culture on the Credibility of Robot Speech: A Comparison between English and Arabic. In Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction - HRI '15 . ACM Press, Portland, Oregon, USA, 157--164. https://doi.org/10.1145/2696454.2696464Google ScholarDigital Library
- Matthew P. Aylett, Benjamin R. Cowan, and Leigh Clark. 2019. Siri, Echo and Performance: You Have to Suffer Darling. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (CHI EA '19). ACM, New York, NY, USA, alt08:1--alt08:10. https://doi.org/10.1145/3290607.3310422 event-place: Glasgow, Scotland Uk.Google ScholarDigital Library
- Christoph Bartneck, Kumar Yogeeswaran, Qi Min Ser, Graeme Woodward, Robert Sparrow, Siheng Wang, and Friederike Eyssel. 2018. Robots And Racism. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction (HRI '18). ACM, New York, NY, USA, 196--204. https://doi.org/10.1145/3171221.3171260 event-place: Chicago, IL, USA.Google ScholarDigital Library
- Erin Beneteau, Olivia K. Richards, Mingrui Zhang, Julie A. Kientz, Jason Yip, and Alexis Hiniker. 2019. Communication Breakdowns Between Families and Alexa. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). ACM, New York, NY, USA, 243:1--243:13. https://doi.org/10.1145/3290605.3300473 event-place: Glasgow, Scotland Uk.Google ScholarDigital Library
- Meera M. Blattner, Denise A. Sumikawa, and Robert M. Greenberg. 1989. Earcons and Icons: Their Structure and Common Design Principles (Abstract Only). SIGCHI Bull. , Vol. 21, 1 (Aug. 1989), 123--124. https://doi.org/10.1145/67880.1046599Google ScholarDigital Library
- Dieter Bohn. 2019. Amazon says 100 million Alexa devices have been sold. https://www.theverge.com/2019/1/4/18168565/amazon-alexa-devices-how-many-sold-number-100-million-dave-limpGoogle Scholar
- Lera Boroditsky, Lauren A Schmidt, and Webb Phillips. 2003. Sex, syntax, and semantics. Language in mind: Advances in the study of language and thought (2003), 61--79.Google Scholar
- Robin N. Brewer, Leah Findlater, Joseph 'Jofish' Kaye, Walter Lasecki, Cosmin Munteanu, and Astrid Weber. 2018. Accessible Voice Interfaces. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '18). ACM, New York, NY, USA, 441--446. https://doi.org/10.1145/3272973.3273006 event-place: Jersey City, NJ, USA.Google ScholarDigital Library
- Minsuk Chang, Anh Truong, Oliver Wang, Maneesh Agrawala, and Juho Kim. 2019. How to Design Voice Based Navigation for How-To Videos . Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems - CHI '19 (2019), 11.Google ScholarDigital Library
- Rebecca Cherng-Shiow Chang, Hsi-Peng Lu, and Peishan Yang. 2018. Stereotypes or golden rules? Exploring likable voice traits of social robots as active aging companions for tech-savvy baby boomers in Taiwan . Computers in Human Behavior , Vol. 84 (July 2018), 194--210. https://doi.org/10.1016/j.chb.2018.02.025Google Scholar
- Brian X. Chen. 2019. Devices That Will Invade Your Life in 2019 (and What's Overhyped) .Google Scholar
- Leigh Clark, Phillip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, and Benjamin Cowan. 2018. The State of Speech in HCI: Trends, Themes and Challenges . arXiv preprint arXiv:1810.06828 (2018).Google Scholar
- Leigh Clark, Abdulmalik Ofemile, Svenja Adolphs, and Tom Rodden. 2016. A Multimodal Approach to Assessing User Experiences with Agent Helpers . ACM Trans. Interact. Intell. Syst. , Vol. 6, 4 (Nov. 2016), 29:1--29:31. https://doi.org/10.1145/2983926Google ScholarDigital Library
- Phil Cohen, Adam Cheyer, Eric Horvitz, Rana El Kaliouby, and Steve Whittaker. 2016. On the Future of Personal Assistants . Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems - CHI EA '16 (2016), 1032--1037. https://doi.org/10.1145/2851581.2886425Google ScholarDigital Library
- Dan Kedmey. 2015. Microsoft's Cortana Gets a Crash Course in Cultural Sensitivity textbar Time . Time Magazine (July 2015). http://time.com/3960670/windows-10-cortana/Google Scholar
- Andreea Danielescu and Gwen Christian. 2018. A Bot is Not a Polyglot . Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems - CHI '18 (2018), 1--9. https://doi.org/10.1145/3170427.3174366Google ScholarDigital Library
- Guy Deutscher. 2010. Through the language glass: Why the world looks different in other languages .Metropolitan Books.Google Scholar
- W. Keith Edwards and Elizabeth D. Mynatt. 1994. An Architecture for Transforming Graphical Interfaces. In Proceedings of the 7th Annual ACM Symposium on User Interface Software and Technology (UIST '94). ACM, New York, NY, USA, 39--47. https://doi.org/10.1145/192426.192443 event-place: Marina del Rey, California, USA.Google Scholar
- Kerstin Fischer, Katrin S Lohan, and Kilian Foth. 2012. Levels of embodiment: Linguistic analyses of factors influencing HRI. In Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction. ACM, 463--470.Google ScholarDigital Library
- BJ Fogg, Gregory Cuellar, and David Danielson. 2019. Motivating, influencing, and persuading users: An introduction to captology. (2019).Google Scholar
- William W Gaver. 1989. The SonicFinder: An interface that uses auditory icons. Human--Computer Interaction , Vol. 4, 1 (1989), 67--94.Google ScholarDigital Library
- Erving Goffman. 1978. The presentation of self in everyday life .Harmondsworth London.Google Scholar
- Rajat Hebbar, Krishna Somandepalli, and Shrikanth Narayanan. 2018. Improving Gender Identification in Movie Audio Using Cross-Domain Data. In Proc. Interspeech 2018. 282--286. https://doi.org/10.21437/Interspeech.2018--1462Google ScholarCross Ref
- Laura Hoffmann, Nikolai Bock, and Astrid M. Rosenthal v.d. Pütten. 2018. The Peculiarities of Robot Embodiment (EmCorp-Scale): Development, Validation and Initial Test of the Embodiment and Corporeality of Artificial Agents Scale. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction (HRI '18). ACM, New York, NY, USA, 370--378. https://doi.org/10.1145/3171221.3171242 event-place: Chicago, IL, USA.Google Scholar
- Miwa Ikemiya and Daniela K. Rosner. 2014. Broken Probes: Toward the Design of Worn Media . Personal Ubiquitous Comput. , Vol. 18, 3 (March 2014), 671--683. https://doi.org/10.1007/s00779-013-0690-yGoogle ScholarDigital Library
- James Vincent. 2018. Google launches more realistic text-to-speech service powered by DeepMind's AI - The Verge . https://www.theverge.com/2018/3/27/17167200/google-ai-speech-tts-cloud-deepmind-wavenetGoogle Scholar
- Eun Hwa Jung, T. Franklin Waddell, and S. Shyam Sundar. 2016. Feminizing Robots: User Responses to Gender Cues on Robot Body and Screen. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems - CHI EA '16. ACM Press, Santa Clara, California, USA, 3107--3113. https://doi.org/10.1145/2851581.2892428Google ScholarDigital Library
- Alisa Kalegina, Grace Schroeder, Aidan Allchin, Keara Berlin, and Maya Cakmak. 2018. Characterizing the Design Space of Rendered Robot Faces. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction - HRI '18. ACM Press, Chicago, IL, USA, 96--104. https://doi.org/10.1145/3171221.3171286Google ScholarDigital Library
- Os Keyes. 2018. The Misgendering Machines: Trans/HCI Implications of Automatic Gender Recognition . Proc. ACM Hum.-Comput. Interact. , Vol. 2, CSCW (Nov. 2018), 88:1--88:22. https://doi.org/10.1145/3274357Google ScholarDigital Library
- Sara Kiesler, Aaron Powers, Susan R Fussell, and Cristen Torrey. 2008. Anthropomorphic interactions with a robot and robot--like agent. Social Cognition , Vol. 26, 2 (2008), 169--181.Google ScholarCross Ref
- Sei Jin Ko, Charles M. Judd, and Irene V. Blair. 2006. What the Voice Reveals: Within- and Between-Category Stereotyping on the Basis of Voice . Personality and Social Psychology Bulletin , Vol. 32, 6 (2006), 806--819. https://doi.org/10.1177/0146167206286627Google ScholarCross Ref
- Rafal Kocielnik, Daniel Avrahami, Jennifer Marlow, Di Lu, and Gary Hsieh. 2018. Designing for Workplace Reflection: A Chat and Voice-Based Conversational Agent . Proceedings of the 2018 Designing Interactive Systems Conference (2018), 881--894. https://doi.org/10.1145/3196709.3196784Google ScholarDigital Library
- Josephine Lau, Benjamin Zimmerman, and Florian Schaub. 2018. Alexa, Are You Listening?: Privacy Perceptions, Concerns and Privacy-seeking Behaviors with Smart Speakers . Proc. ACM Hum.-Comput. Interact. , Vol. 2, CSCW (Nov. 2018), 102:1--102:31. https://doi.org/10.1145/3274371Google ScholarDigital Library
- Lauren Goode. 2018. How Google's Eerie Robot Phone Calls Hint at AI's Future . Wired (May 2018). https://www.wired.com/story/google-duplex-phone-calls-ai-future/Google Scholar
- Kwan Min Lee, Wei Peng, Seung-A Jin, and Chang Yan. 2006. Can Robots Manifest Personality?: An Empirical Test of Personality Recognition, Social Responses, and Social Presence in Human--Robot Interaction . Journal of Communication , Vol. 56, 4 (2006), 754--772. https://doi.org/10.1111/j.1460--2466.2006.00318.xGoogle ScholarCross Ref
- M. K. Lee, S. Kiesler, J. Forlizzi, S. Srinivasa, and P. Rybski. 2010. Gracefully mitigating breakdowns in robotic services. In 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI) . 203--210. https://doi.org/10.1109/HRI.2010.5453195Google Scholar
- Lily Hay Newman. 2014. This Social Robot Is Adorable. But Will Families Actually Want One? Slate (July 2014). https://slate.com/technology/2014/07/social-robotics-expert-cynthia-breazeal-debuts-jibo-a-family-robot.htmlGoogle Scholar
- Nichola Lubold, Erin Walker, and Heather Pon-Barry. 2016. Effects of voice-adaptation and social dialogue on perceptions of a robotic learning companion. In 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI). IEEE, 255--262.Google ScholarCross Ref
- Nichola Lubold, Erin Walker, Heather Pon-Barry, and Amy Ogan. 2018. Automated pitch convergence improves learning in a social, teachable robot for middle school mathematics. In International Conference on Artificial Intelligence in Education. Springer, 282--296.Google ScholarCross Ref
- Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents . Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI '16 (2016), 5286--5297. https://doi.org/10.1145/2858036.2858288Google ScholarDigital Library
- Nikolas Martelaro and Wendy Ju. 2017. WoZ Way: Enabling Real-time Remote Interaction Prototyping & Observation in On-road Vehicles. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing - CSCW '17. ACM Press, Portland, Oregon, USA, 169--182. https://doi.org/10.1145/2998181.2998293Google Scholar
- Matt Simon. 2019. The Genderless Digital Voice the World Needs Right Now . https://www.wired.com/story/the-genderless-digital-voice-the-world-needs-right-now/Google Scholar
- Phil McAleer, Alexander Todorov, and Pascal Belin. 2014. How Do You Say 'Hello'? Personality Impressions from Brief Novel Voices . PLoS ONE , Vol. 9, 3 (March 2014), e90779. https://doi.org/10.1371/journal.pone.0090779Google ScholarCross Ref
- C. McGinn and I. Torre. 2019. Can you Tell the Robot by the Voice? An Exploratory Study on the Role of Voice in the Perception of Robots. In 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI) . 211--221. https://doi.org/10.1109/HRI.2019.8673305Google Scholar
- Moira McGregor and John C. Tang. 2017. More to Meetings: Challenges in Using Speech-Based Technology to Support Meetings. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17). ACM, New York, NY, USA, 2208--2220. https://doi.org/10.1145/2998181.2998335 event-place: Portland, Oregon, USA.Google Scholar
- Michal Luria, Samantha Reig, Xiang Zhi Tan, Aaron Steinfeld, Jodi Forlizzi, and John Zimmerman. [n. d.]. Re-Embodiment and Co-Embodiment: Exploration of Social Presence for Robots and Conversational Agents. In Proceedings of the 2018 on Designing Interactive Systems Conference 2019 - DIS '19 .Google Scholar
- Rani Molla. 2018. Voice tech like Alexa and Siri hasn't found its true calling yet: Inside the voice assistant 'revolution'. Recode (2018). https://www.recode.net/2018/11/12/17765390/voice-alexa-siri-assistant-amazon-echo-google-assistantGoogle Scholar
- Juan Manuel Montero, Juana M Gutierrez-Arriola, Sira Palazuelos, Emilia Enriquez, Santiago Aguilera, and José Manuel Pardo. 1998. Emotional speech synthesis: From speech database to TTS. In Fifth International Conference on Spoken Language Processing.Google Scholar
- Dylan Moore, Hamish Tennent, Nikolas Martelaro, and Wendy Ju. 2017. Making noise intentional: A study of servo sound perception. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction. ACM, 12--21.Google ScholarDigital Library
- Roger K Moore. 2017a. Appropriate Voices for Artefacts: Some Key Insights. In 1st International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots.Google Scholar
- Roger K. Moore. 2017b. Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction . In Dialogues with Social Robots: Enablements, Analyses, and Evaluation , , Kristiina Jokinen and Graham Wilcock (Eds.). Springer Singapore, Singapore, 281--291. https://doi.org/10.1007/978--981--10--2585--3_22Google Scholar
- Christine Murad, Cosmin Munteanu, Leigh Clark, and Benjamin R Cowan. 2018. Design Guidelines for Hands-free Speech Interaction. In Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (MobileHCI '18). ACM, New York, NY, USA, 269--276. https://doi.org/10.1145/3236112.3236149Google ScholarDigital Library
- Bilge Mutlu, Steven Osman, Jodi Forlizzi, Jessica Hodgins, and Sara Kiesler. 2006. Task Structure and User Attributes as Elements of Human-Robot Interaction Design. In ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication. IEEE, Univ. of Hertfordshire, Hatfield, UK, 74--79. https://doi.org/10.1109/ROMAN.2006.314397Google ScholarCross Ref
- Chelsea Myers, Anushay Furqan, Jessica Nebolsky, Karina Caro, and Jichen Zhu. 2018. Patterns for How Users Overcome Obstacles in Voice User Interfaces. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, 6:1--6:7. https://doi.org/10.1145/3173574.3173580 event-place: Montreal QC, Canada.Google ScholarDigital Library
- Clifford Nass and Scott Brave. 2005. Wired for speech: How voice activates and advances the human-computer relationship. MIT press.Google ScholarDigital Library
- Clifford Nass and Kwan Min Lee. 2001. Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. Journal of experimental psychology: applied , Vol. 7, 3 (2001), 171.Google ScholarCross Ref
- Clifford Nass, Jonathan Steuer, and Ellen R. Tauber. 1994. Computers Are Social Actors. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '94). ACM, New York, NY, USA, 72--78. https://doi.org/10.1145/191666.191703 event-place: Boston, Massachusetts, USA.Google ScholarDigital Library
- Kenneth Olmstead. 2017. Nearly half of Americans use digital voice assistants, mostly on their smartphones . Technical Report. Pew Research Center. https://www.pewresearch.org/fact-tank/2017/12/12/nearly-half-of-americans-use-digital-voice-assistants-mostly-on-their-smartphones/Google Scholar
- Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, and Koray Kavukcuoglu. 2016. Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499 (2016).Google Scholar
- Sarah Perez. 2018. Alexa developers get 8 free voices to use in skills, courtesy of Amazon Polly . TechCrunch (May 2018). https://techcrunch.com/2018/05/16/alexa-developers-get-8-free-voices-to-use-in-skills-courtesy-of-amazon-polly/Google Scholar
- Martha L Picariello, Danna N Greenberg, and David B Pillemer. 1990. Children's sex-related stereotyping of colors. Child Development , Vol. 61, 5 (1990), 1453--1460.Google ScholarCross Ref
- Martin Porcheron, Joel E Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life . Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems - CHI '18 (2018), 1--12. https://doi.org/doi.org/10.1145/3173574.3174214Google ScholarDigital Library
- Martin Porcheron, Joel E. Fischer, and Sarah Sharples. 2017. "Do Animals Have Accents?": Talking with Agents in Multi-Party Conversation. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17). ACM, New York, NY, USA, 207--219. https://doi.org/10.1145/2998181.2998298 event-place: Portland, Oregon, USA.Google ScholarDigital Library
- A Purington, J G Taft, S Sannon, N N Bazarova, and S H Taylor. 2017. "Alexa is my new BFF": Social roles, user satisfaction, and personification of the Amazon Echo . Conference on Human Factors in Computing Systems - Proceedings , Vol. Part F1276 (2017), 2853--2859. https://doi.org/10.1145/3027063.3053246Google ScholarDigital Library
- Sara Perez. 2019. Report: Voice assistants in use to triple to 8 billion by 2023 textbar TechCrunch . https://techcrunch.com/2019/02/12/report-voice-assistants-in-use-to-triple-to-8-billion-by-2023/Google Scholar
- Alex Sciuto, Arnita Saini, Jodi Forlizzi, and Jason I Hong. 2018. "Hey Alexa, What's Up?": A Mixed-Methods Studies of In-Home Conversational Agent Usage . Proceedings of the 2018 on Designing Interactive Systems Conference 2018 - DIS '18 (2018), 857--868. https://doi.org/10.1145/3196709.3196772Google ScholarDigital Library
- Selina Jeanne Sutton , Paul Foulkes, David Kirk, and Shaun Lawson. 2019. Voice as a Design Material: Sociophonetic Inspired Design Strategies in Human-Computer Interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems - CHI '19 . 1--14. https://doi.org/10.1145/3290605.3300833Google ScholarDigital Library
- Ben Shneiderman. 2000. The limits of speech recognition. Commun. ACM , Vol. 43, 9 (2000), 63--65. https://doi.org/10.1145/348941.348990Google ScholarDigital Library
- Berrak Sisman and Haizhou Li. 2018. Wavelet Analysis of Speaker Dependent and Independent Prosody for Voice Conversion. In Proc. Interspeech 2018 . 52--56. https://doi.org/10.21437/Interspeech.2018--1499Google ScholarCross Ref
- Aaron Springer and Henriette Cramer. 2018. 'Play PRBLMS": Identifying and Correcting Less Accessible Content in Voice Interfaces . (2018), 1--13. https://doi.org/10.1145/3173574.3173870Google Scholar
- Vasant Srinivasan and Leila Takayama. 2016. Help Me Please: Robot Politeness Strategies for Soliciting Help From Humans. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 4945--4955. https://doi.org/10.1145/2858036.2858217 event-place: San Jose, California, USA.Google ScholarDigital Library
- Chandra Steele. 2018. The Real Reason Voice Assistants Are Female (and Why it Matters). PCMag (2018).Google Scholar
- Marie Louise Juul Søndergaard and Lone Koefoed Hansen. 2018. Intimate Futures: Staying with the Trouble of Digital Personal Assistants through Design Fiction . Proceedings of the 2018 on Designing Interactive Systems Conference 2018 - DIS '18 (2018), 869--880. https://doi.org/10.1145/3196709.3196766Google ScholarDigital Library
- Benedict Tay, Younbo Jung, and Taezoon Park. 2014. When stereotypes meet robots: The double-edge sword of robot gender and personality in human--robot interaction. Computers in Human Behavior , Vol. 38 (Sept. 2014), 75--84. https://doi.org/10.1016/j.chb.2014.05.014Google ScholarDigital Library
- Hamish Tennent, Dylan Moore, Malte Jung, and Wendy Ju. 2017. Good vibrations: How consequential sounds affect perception of robotic arms. In 2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN). IEEE, 928--935.Google ScholarDigital Library
- Cristen Torrey, Susan Fussell, and Sara Kiesler. 2013. How a Robot Should Give Advice. In Proceedings of the 8th ACM/IEEE International Conference on Human-robot Interaction (HRI '13). IEEE Press, Piscataway, NJ, USA, 275--282. http://dl.acm.org/citation.cfm?id=2447556.2447666 event-place: Tokyo, Japan.Google ScholarDigital Library
- James Vincent. 2019. Kohler's smart toilet promises a 'fully-immersive experience'. The Verge (2019). https://www.theverge.com/2019/1/6/18170575/kohler-konnect-bathroom-smart-gadgets-numi-intelligent-toilet-ces-2019Google Scholar
- Mark West, Rebecca Kraut, and Han Ei Chew. 2019. I'd blush if I could: closing gender divides in digital skills through education . Technical Report. UNESCO, EQUALS Skills Coalition. https://unesdoc.unesco.org/ark:/48223/pf0000367416.locale=enGoogle Scholar
- Mirjam Wester, Cassia Valentini-Botinhao, and Gustav Eje Henter. 2015. Are We Using Enough Listeners? No!-An Empirically-Supported Critique of Interspeech 2014 TTS Evaluations. In Sixteenth Annual Conference of the International Speech Communication Association.Google ScholarCross Ref
Index Terms
- One Voice Fits All?: Social Implications and Research Challenges of Designing Voices for Smart Devices
Recommendations
Female by Default? – Exploring the Effect of Voice Assistant Gender and Pitch on Trait and Trust Attribution
CHI EA '21: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing SystemsGendered voice based on pitch is a prevalent design element in many contemporary Voice Assistants (VAs) but has shown to strengthen harmful stereotypes. Interestingly, there is a dearth of research that systematically analyses user perceptions of ...
KinVoices: Using Voices of Friends and Family in Voice Interfaces
CSCW2With voice user interfaces (VUIs) becoming ubiquitous and speech synthesis technology maturing, it is possible to synthesise voices to resemble our friends and relatives (which we will collectively call 'kin') and use them on VUIs. However, designing ...
Exploring Humor as a Repair Strategy During Communication Breakdowns with Voice Assistants
CUI '23: Proceedings of the 5th International Conference on Conversational User InterfacesVoice assistants are becoming increasingly useful and support realistic conversations, yet communication breakdowns occur. We investigate the use of humor as a repair strategy in an experiment where the voice assistant makes a mistake and then utilizes ...
Comments