ABSTRACT
In the last decade, interest in accessible and eyes-free text entry has continued to grow. However, little research has been done to explore the feasibility of using audibly distinct phrases for text entry tasks. To better understand whether preexisting phrases used in text entry research are sufficiently distinct for eyes-free text entry tasks, we used Microsoft’s and Apple’s desktop text-to-speech systems to generate all 500 phrases from MacKenzie and Soukoreff’s set [32] using the default male and female voices. We then asked 392 participants recruited through Amazon’s Mechanical Turk to transcribe the generated audio clips. We report participant transcription errors and present the 96 phrases that were observed with no comprehension errors. These phrases were further tested with 80 participants who identified as low-vision and/or blind recruited through Twitter. We contribute the 92 phrases that were observed to maintain no comprehension errors across both experiments.
- Ali Abdolrahmani, Ravi Kuber, and Stacy M. Branham. 2018. ”Siri Talks at You”: An Empirical Investigation of Voice-Activated Personal Assistant (VAPA) Usage by Individuals Who Are Blind. In Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (Galway, Ireland) (ASSETS ’18). Association for Computing Machinery, New York, NY, USA, 249–258. https://doi.org/10.1145/3234695.3236344Google ScholarDigital Library
- Maraim Alnfiai and Srinivas Sampalli. 2016. An Evaluation of SingleTapBraille Keyboard: A Text Entry Method That Utilizes Braille Patterns on Touchscreen Devices. In Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility (Reno, Nevada, USA) (ASSETS ’16). Association for Computing Machinery, New York, NY, USA, 161–169. https://doi.org/10.1145/2982142.2982161Google ScholarDigital Library
- Mrim Alnfiai and Srinivas Sampalli. 2017. BrailleEnter: A Touch Screen Braille Text Entry Method for the Blind. Procedia Computer Science 109 (2017), 257 – 264. https://doi.org/10.1016/j.procs.2017.05.349 8th International Conference on Ambient Systems, Networks and Technologies, ANT-2017 and the 7th International Conference on Sustainable Energy Information Technology, SEIT 2017, 16-19 May 2017, Madeira, Portugal.Google ScholarCross Ref
- Gunilla Alsio and Mikael Goldstein. 2000. Productivity prediction by extrapolation: Using workload memory as a predictor of target performance. Behaviour & Information Technology 19, 2 (2000), 87–96. https://doi.org/10.1080/014492900118696 arXiv:https://doi.org/10.1080/014492900118696Google ScholarCross Ref
- Tawfiq Ammari, Jofish Kaye, Janice Y. Tsai, and Frank Bentley. 2019. Music, Search, and IoT: How People (Really) Use Voice Assistants. ACM Trans. Comput.-Hum. Interact. 26, 3, Article 17 (April 2019), 28 pages. https://doi.org/10.1145/3311956Google ScholarDigital Library
- Ahmed Sabbir Arif and Wolfgang Stuerzlinger. 2009. Analysis of text entry performance metrics. In 2009 IEEE Toronto International Conference Science and Technology for Humanity (TIC-STH). IEEE, Toronto, Ontario, Canada, 100–105.Google ScholarCross Ref
- Chieko Asakawa, Hironobu Takagi, Shuichi Ino, and Tohru Ifukube. 2003. Maximum listening speeds for the blind, In Proceedings of the 2003 International Conference on Auditory Display. International Conference on Auditory Display 9, 276–279.Google Scholar
- Shiri Azenkot and Nicole B. Lee. 2013. Exploring the Use of Speech Input by Blind People on Mobile Devices. In Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility(Bellevue, Washington) (ASSETS ’13). Association for Computing Machinery, New York, NY, USA, Article 11, 8 pages. https://doi.org/10.1145/2513383.2513440Google ScholarDigital Library
- Shiri Azenkot, Jacob O Wobbrock, Sanjana Prasain, and Richard E Ladner. 2012. Input finger detection for nonvisual touch screen text entry in Perkinput. In Proceedings of Graphics Interface 2012. Association for Computing Machinery, Toronto, ON, Canada, 121–129.Google Scholar
- Shiri Azenkot and Shumin Zhai. 2012. Touch Behavior with Different Postures on Soft Smartphone Keyboards. In Proceedings of the 14th International Conference on Human-Computer Interaction with Mobile Devices and Services (San Francisco, California, USA) (MobileHCI ’12). Association for Computing Machinery, New York, NY, USA, 251–260. https://doi.org/10.1145/2371574.2371612Google ScholarDigital Library
- Berdasco, López, Diaz, Quesada, and Guerrero. 2019. User Experience Comparison of Intelligent Personal Assistants: Alexa, Google Assistant, Siri and Cortana. Proceedings 31, 1 (Nov 2019), 51. https://doi.org/10.3390/proceedings2019031051Google ScholarCross Ref
- Bhakti Bhikne, Anirudha Joshi, Manjiri Joshi, Charudatta Jadhav, and Prabodh Sakhardande. 2019. Faster and Less Error-Prone: Supplementing an Accessible Keyboard with Speech Input. In Human-Computer Interaction – INTERACT 2019, David Lamas, Fernando Loizides, Lennart Nacke, Helen Petrie, Marco Winckler, and Panayiotis Zaphiris (Eds.). Springer International Publishing, Cham, 288–304.Google ScholarDigital Library
- Matthew N. Bonner, Jeremy T. Brudvik, Gregory D. Abowd, and W. Keith Edwards. 2010. No-Look Notes: Accessible Eyes-Free Multi-touch Text Entry. In Pervasive Computing, Patrik Floréen, Antonio Krüger, and Mirjana Spasojevic (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 409–426.Google ScholarDigital Library
- Julia Cambre, Jessica Colnago, Jim Maddock, Janice Tsai, and Jofish Kaye. 2020. Choice of Voices: A Large-Scale Evaluation of Text-to-Speech Voice Quality for Long-Form Content. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3313831.3376789Google ScholarDigital Library
- Edward Clarkson, James Clawson, Kent Lyons, and Thad Starner. 2005. An Empirical Study of Typing Rates on Mini-QWERTY Keyboards. In CHI ’05 Extended Abstracts on Human Factors in Computing Systems (Portland, OR, USA) (CHI EA ’05). Association for Computing Machinery, New York, NY, USA, 1288–1291. https://doi.org/10.1145/1056808.1056898Google ScholarDigital Library
- James Clawson, Kent Lyons, Thad Starner, and Edward Clarkson. 2005. The impacts of limited visual feedback on mobile text entry for the Twiddler and mini-QWERTY keyboards. In Ninth IEEE International Symposium on Wearable Computers (ISWC’05). IEEE, Osaka, Japan, 170–177.Google ScholarDigital Library
- James Clawson, Thad Starner, Daniel Kohlsdorf, David P. Quigley, and Scott Gilliland. 2014. Texting While Walking: An Evaluation of Mini-Qwerty Text Input While on-the-Go. In Proceedings of the 16th International Conference on Human-Computer Interaction with Mobile Devices & Services (Toronto, ON, Canada) (MobileHCI ’14). Association for Computing Machinery, New York, NY, USA, 339–348. https://doi.org/10.1145/2628363.2628408Google ScholarDigital Library
- David Dearman, Amy Karlson, Brian Meyers, and Ben Bederson. 2010. Multi-Modal Text Entry and Selection on a Mobile Device. In Proceedings of Graphics Interface 2010 (Ottawa, Ontario, Canada) (GI ’10). Canadian Information Processing Society, CAN, 19–26.Google ScholarDigital Library
- Marc Franco-Salvador and Luis A Leiva. 2018. Multilingual phrase sampling for text entry evaluations. International Journal of Human-Computer Studies 113 (2018), 15–31.Google ScholarDigital Library
- Mayank Goel, Leah Findlater, and Jacob Wobbrock. 2012. WalkType: Using Accelerometer Data to Accomodate Situational Impairments in Mobile Touch Screen Text Entry. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Austin, Texas, USA) (CHI ’12). Association for Computing Machinery, New York, NY, USA, 2687–2696. https://doi.org/10.1145/2207676.2208662Google ScholarDigital Library
- Aakar Gupta, Cheng Ji, Hui-Shyong Yeo, Aaron Quigley, and Daniel Vogel. 2019. RotoSwype: Word-Gesture Typing Using a Ring. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300244Google ScholarDigital Library
- Juan David Hincapié-Ramos, Xiang Guo, Paymahn Moghadasian, and Pourang Irani. 2014. Consumed Endurance: A Metric to Quantify Arm Fatigue of Mid-Air Interactions. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Toronto, Ontario, Canada) (CHI ’14). Association for Computing Machinery, New York, NY, USA, 1063–1072. https://doi.org/10.1145/2556288.2557130Google ScholarDigital Library
- Eve Hoggan, Stephen A. Brewster, and Jody Johnston. 2008. Investigating the Effectiveness of Tactile Feedback for Mobile Touchscreens. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Florence, Italy) (CHI ’08). Association for Computing Machinery, New York, NY, USA, 1573–1582. https://doi.org/10.1145/1357054.1357300Google ScholarDigital Library
- Per Ola Kristensson, Stephen Brewster, James Clawson, Mark Dunlop, Leah Findlater, Poika Isokoski, Benoît Martin, Antti Oulasvirta, Keith Vertanen, and Annalu Waller. 2013. Grand Challenges in Text Entry. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems (Paris, France) (CHI EA ’13). Association for Computing Machinery, New York, NY, USA, 3315–3318. https://doi.org/10.1145/2468356.2479675Google ScholarDigital Library
- Per-Ola Kristensson and Shumin Zhai. 2004. SHARK2: A Large Vocabulary Shorthand Writing System for Pen-Based Computers. In Proceedings of the 17th Annual ACM Symposium on User Interface Software and Technology (Santa Fe, NM, USA) (UIST ’04). Association for Computing Machinery, New York, NY, USA, 43–52. https://doi.org/10.1145/1029632.1029640Google ScholarDigital Library
- Jennifer Lai, Karen Cheng, Paul Green, and Omer Tsimhoni. 2001. On the Road and on the Web? Comprehension of Synthetic and Human Speech While Driving. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Seattle, Washington, USA) (CHI ’01). Association for Computing Machinery, New York, NY, USA, 206–212. https://doi.org/10.1145/365024.365100Google ScholarDigital Library
- Jennifer Lai, David Wood, and Michael Considine. 2000. The Effect of Task Conditions on the Comprehensibility of Synthetic Speech. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (The Hague, The Netherlands) (CHI ’00). Association for Computing Machinery, New York, NY, USA, 321–328. https://doi.org/10.1145/332040.332451Google ScholarDigital Library
- David YW Lee. 2010. What corpora are available?In The Routledge handbook of corpus linguistics. Routledge, New York, NY, USA, 107–121.Google Scholar
- Vladimir I Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady, Vol. 10.8. Doklady Akademii Nauk SSSR, Soviet Union, 707–710.Google Scholar
- Kent Lyons, Thad Starner, Daniel Plaisted, James Fusia, Amanda Lyons, Aaron Drew, and E. W. Looney. 2004. Twiddler Typing: One-Handed Chording Text Entry for Mobile Phones. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Vienna, Austria) (CHI ’04). Association for Computing Machinery, New York, NY, USA, 671–678. https://doi.org/10.1145/985692.985777Google ScholarDigital Library
- I. Scott MacKenzie and R. William Soukoreff. 2002. Text Entry for Mobile Computing: Models and Methods,Theory and Practice. Human–Computer Interaction 17, 2-3 (2002), 147–198. https://doi.org/10.1080/07370024.2002.9667313 arXiv:https://www.tandfonline.com/doi/pdf/10.1080/07370024.2002.9667313Google ScholarCross Ref
- I. Scott MacKenzie and R. William Soukoreff. 2003. Phrase Sets for Evaluating Text Entry Techniques. In CHI ’03 Extended Abstracts on Human Factors in Computing Systems (Ft. Lauderdale, Florida, USA) (CHI EA ’03). Association for Computing Machinery, New York, NY, USA, 754–755. https://doi.org/10.1145/765891.765971Google ScholarDigital Library
- Edgar Matias, I. Scott MacKenzie, and William Buxton. 1994. Half-QWERTY: Typing with One Hand Using Your Two-Handed Skills. In Conference Companion on Human Factors in Computing Systems (Boston, Massachusetts, USA) (CHI ’94). Association for Computing Machinery, New York, NY, USA, 51–52. https://doi.org/10.1145/259963.260024Google ScholarDigital Library
- Hugo Nicolau, Kyle Montague, Tiago Guerreiro, André Rodrigues, and Vicki L. Hanson. 2015. Typing Performance of Blind Users: An Analysis of Touch Behaviors, Learning Effect, and In-Situ Usage. In Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility (Lisbon, Portugal) (ASSETS ’15). Association for Computing Machinery, New York, NY, USA, 273–280. https://doi.org/10.1145/2700648.2809861Google ScholarDigital Library
- João Oliveira, Tiago Guerreiro, Hugo Nicolau, Joaquim Jorge, and Daniel Gonçalves. 2011. BrailleType: Unleashing Braille over Touch Screen Mobile Phones. In Human-Computer Interaction – INTERACT 2011, Pedro Campos, Nicholas Graham, Joaquim Jorge, Nuno Nunes, Philippe Palanque, and Marco Winckler (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 100–107.Google ScholarCross Ref
- Stephen Oney, Chris Harrison, Amy Ogan, and Jason Wiese. 2013. ZoomBoard: A Diminutive Qwerty Soft Keyboard Using Iterative Zooming for Ultra-Small Devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Paris, France) (CHI ’13). Association for Computing Machinery, New York, NY, USA, 2799–2802. https://doi.org/10.1145/2470654.2481387Google ScholarDigital Library
- Tim Paek and Bo-June (Paul) Hsu. 2011. Sampling Representative Phrase Sets for Text Entry Experiments: A Procedure and Public Resource. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Vancouver, BC, Canada) (CHI ’11). Association for Computing Machinery, New York, NY, USA, 2477–2480. https://doi.org/10.1145/1978942.1979304Google ScholarDigital Library
- David B. Pisoni, Laura M. Manous, and Michael J. Dedina. 1987. Comprehension of natural and synthetic speech: effects of predictability on the verification of sentences controlled for intelligibility. Computer Speech & Language 2, 3 (1987), 303 – 320. https://doi.org/10.1016/0885-2308(87)90014-3Google ScholarCross Ref
- Alisha Pradhan, Kanika Mehta, and Leah Findlater. 2018. ”Accessibility Came by Accident”: Use of Voice-Controlled Intelligent Personal Assistants by People with Disabilities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (Montreal QC, Canada) (CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3173574.3174033Google ScholarDigital Library
- R Core Team. 2019. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/Google Scholar
- André Rodrigues, Kyle Montague, Hugo Nicolau, and Tiago Guerreiro. 2015. Getting Smartphones to Talkback: Understanding the Smartphone Adoption Process of Blind Users. In Proceedings of the 17th International ACM SIGACCESS Conference on Computers &; Accessibility (Lisbon, Portugal) (ASSETS ’15). Association for Computing Machinery, New York, NY, USA, 23–32. https://doi.org/10.1145/2700648.2809842Google ScholarDigital Library
- Dongkwang Shin and Paul Nation. 2008. Beyond single words: The most frequent collocations in spoken English. ELT journal 62, 4 (2008), 339–348.Google ScholarCross Ref
- R. William Soukoreff and I. Scott MacKenzie. 2001. Measuring Errors in Text Entry Tasks: An Application of the Levenshtein String Distance Statistic. In CHI ’01 Extended Abstracts on Human Factors in Computing Systems (Seattle, Washington) (CHI EA ’01). Association for Computing Machinery, New York, NY, USA, 319–320. https://doi.org/10.1145/634067.634256Google ScholarDigital Library
- R. William Soukoreff and I. Scott MacKenzie. 2003. Metrics for Text Entry Research: An Evaluation of MSD and KSPC, and a New Unified Error Metric. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Ft. Lauderdale, Florida, USA) (CHI ’03). Association for Computing Machinery, New York, NY, USA, 113–120. https://doi.org/10.1145/642611.642632Google ScholarDigital Library
- Caleb Southern, James Clawson, Brian Frey, Gregory Abowd, and Mario Romero. 2012. An Evaluation of BrailleTouch: Mobile Touchscreen Text Entry for the Visually Impaired. In Proceedings of the 14th International Conference on Human-Computer Interaction with Mobile Devices and Services (San Francisco, California, USA) (MobileHCI ’12). Association for Computing Machinery, New York, NY, USA, 317–326. https://doi.org/10.1145/2371574.2371623Google ScholarDigital Library
- Toshiyuki Takezawa, Genichiro Kikui, Masahide Mizushima, and Eiichiro Sumita. 2007. Multilingual spoken language corpus development for communication research. In International Journal of Computational Linguistics & Chinese Language Processing, Volume 12, Number 3, September 2007: Special Issue on Invited Papers from ISCSLP 2006. Springer, Signapore, 303–324.Google Scholar
- Hussain Tinwala and I. Scott MacKenzie. 2008. Letterscroll: Text Entry Using a Wheel for Visually Impaired Users. In CHI ’08 Extended Abstracts on Human Factors in Computing Systems (Florence, Italy) (CHI EA ’08). Association for Computing Machinery, New York, NY, USA, 3153–3158. https://doi.org/10.1145/1358628.1358823Google ScholarDigital Library
- Hussain Tinwala and I. Scott MacKenzie. 2010. Eyes-Free Text Entry with Error Correction on Touchscreen Mobile Devices. In Proceedings of the 6th Nordic Conference on Human-Computer Interaction: Extending Boundaries(Reykjavik, Iceland) (NordiCHI ’10). Association for Computing Machinery, New York, NY, USA, 511–520. https://doi.org/10.1145/1868914.1868972Google ScholarDigital Library
- International Telecommunication Union. 2012. Recommendation ITU-R BS.1770-3: Algorithms to Measure Audio Programme Loudness and True-Peak Audio Level. www.itu.int/dms_pubrec/itu-r/rec/bs/R-REC-BS.1770-3-201208-S!!PDF-E.pdfGoogle Scholar
- M.P.J. van der Loo. 2014. The stringdist package for approximate string matching. The R Journal 6(2014), 111–122. Issue 1. https://CRAN.R-project.org/package=stringdistGoogle ScholarCross Ref
- Keith Vertanen and Per Ola Kristensson. 2011. A Versatile Dataset for Text Entry Evaluations Based on Genuine Mobile Emails. In Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services (Stockholm, Sweden) (MobileHCI ’11). Association for Computing Machinery, New York, NY, USA, 295–298. https://doi.org/10.1145/2037373.2037418Google ScholarDigital Library
- Keith Vertanen, Haythem Memmi, Justin Emge, Shyam Reyal, and Per Ola Kristensson. 2015. VelociTap: Investigating Fast Mobile Text Entry Using Sentence-Based Decoding of Touchscreen Keyboard Input. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (Seoul, Republic of Korea) (CHI ’15). Association for Computing Machinery, New York, NY, USA, 659–668. https://doi.org/10.1145/2702123.2702135Google ScholarDigital Library
- David J. Ward, Alan F. Blackwell, and David J. C. MacKay. 2000. Dasher—a Data Entry Interface Using Continuous Gestures and Language Models. In Proceedings of the 13th Annual ACM Symposium on User Interface Software and Technology(San Diego, California, USA) (UIST ’00). Association for Computing Machinery, New York, NY, USA, 129–137. https://doi.org/10.1145/354401.354427Google ScholarDigital Library
- Tetsuya Watanabe, Hirotsugu Kaga, Makoto Kobayashi, and Kazunori Minatani. 2019. Touchscreen Text Entry Methods Used by Blind and Low Vision Users. ITE Transactions on Media Technology and Applications 7, 3(2019), 134–141.Google ScholarCross Ref
- Tetsuya Watanabe, Hirotsugu Kaga, and Shota Shinkai. 2018. Comparison of onscreen text entry methods when using a screen reader. IEICE TRANSACTIONS on Information and Systems 101, 2 (2018), 455–461.Google ScholarCross Ref
- Jacob O. Wobbrock and Brad A. Myers. 2006. Analyzing the Input Stream for Character- Level Errors in Unconstrained Text Entry Evaluations. ACM Trans. Comput.-Hum. Interact. 13, 4 (Dec. 2006), 458–489. https://doi.org/10.1145/1188816.1188819Google ScholarDigital Library
- Jacob O. Wobbrock, Brad A. Myers, and John A. Kembel. 2003. EdgeWrite: A Stylus-Based Text Entry Method Designed for High Accuracy and Stability of Motion. In Proceedings of the 16th Annual ACM Symposium on User Interface Software and Technology (Vancouver, Canada) (UIST ’03). Association for Computing Machinery, New York, NY, USA, 61–70. https://doi.org/10.1145/964696.964703Google ScholarDigital Library
- Patricia Wright, Christine Bartram, Nick Rogers, Hazel Emslie, Jonathan Evans, Barbara Wilson, and Steve Belt. 2000. Text entry on handheld computers by older users. Ergonomics 43, 6 (2000), 702–716. https://doi.org/10.1080/001401300404689 arXiv:https://doi.org/10.1080/001401300404689PMID: 10902881.Google ScholarCross Ref
- Suwen Zhu, Jingjie Zheng, Shumin Zhai, and Xiaojun Bi. 2019. I’sFree: Eyes-Free Gesture Typing via a Touch-Enabled Remote Control. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300678Google ScholarDigital Library
Index Terms
- Identifying an Aurally Distinct Phrase Set for Text Entry Techniques
Recommendations
Reviewing Speech Input with Audio: Differences between Blind and Sighted Users
Speech input is a primary method of interaction for blind mobile device users, yet the process of dictating and reviewing recognized text through audio only (i.e., without access to visual feedback) has received little attention. A recent study found ...
Identifying Speech Input Errors Through Audio-Only Interaction
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing SystemsSpeech has become an increasingly common means of text input, from smartphones and smartwatches to voice-based intelligent personal assistants. However, reviewing the recognized text to identify and correct errors is a challenge when no visual feedback ...
Applications in accessibility of text-to-speech synthesis for South African languages: initial system integration and user engagement
SAICSIT '17: Proceedings of the South African Institute of Computer Scientists and Information TechnologistsPersons with certain disabilities face barriers to information access and interpersonal communication. Assistive technologies provide workaround solutions to these problems. Augmentative and alternative communication systems aid the person with little ...
Comments