Abstract
The effects of message type (navigation, E-mail, news story), voice type (text-to-speech, natural human speech), and earcon cueing (present, absent) on message comprehension and driving performance were examined. Twenty-four licensed drivers (12 under 30, 12 over 65, both equally divided by gender) participated in the experiment. They drove the UMTRI driving simulator on a road consisting of straight sections and constant radius curves, thus yielding two levels of low driving-workload. In addition, as a control condition, data were collected while participants were parked. In all conditions, participants were presented with three types of messages. Each message was immediately followed by a series of questions to assess comprehension. Navigation messages were about 4 seconds long (about 9 words). E-mail messages were about 40 seconds long (about 100 words) and news messages were about 80 seconds long (about 225 words). For all message types, comprehension of text-to-speech messages, as determined by accuracy of response to questions, and by subjective ratings, was significantly worse than comprehension of natural speech (79 versus 83 percent correct answers; 7.7/10 versus 8.6/10 subjective rating). Driving workload did not affect comprehension. Interestingly, neither the speech used (synthesized or natural) nor the message type (navigation, E-mail, news) had a significant effect on basic driving performance measured by the standard deviations of lateral lane position and steering wheel angle.
Similar content being viewed by others
References
Belz, S.M., Winters, J.J., Robinson, G.S., and Casali, J.G. (1997). Auditory icons: A new class of transportation subsystem(SAE paper 973185), Warrendale, PA: Society of Automotive Engineers.
Brown, I.D. (1965). Effect of a car radio on driving in traffic.Ergonomics, 8(4):475-479.
Bruno, A. (1999). Auto industry drives telematic services.RCRRadio Communications Report, 18(37):40-41.
Fleming, J., Green, P., and Katz, S. (1998). Driver performance and memory for traffic messages: Effects of the number of messages, audio quality, and relevance (Technical Report UMTRI-98-22). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Francis, A.L. and Nusbaum, H.C. (1999). Evaluating the quality of synthetic speech. In D. Gardner-Bonneau (Ed.), Human Factors and Voice Interactive Systems. Boston, MA: Kluwer Academic Publishers, pp. 63-97.
Goodman, M., Bents, F.D., Tijerina, L., Wierwille, W., Lerner, N., and Benel, D. (1997). An investigation of the safety implications of wireless communication in vehicles (Technical Report DOT HS 808 635). Washington, D.C.: U.S. Department of Transportation (http://www.nhtsa.dot.gov/people/injury/research/wireless/).
Green, P. (1993). Measures and methods used to assess the safety and usability of driver information systems (Technical Report UMTRI-93-12). Ann Arbor, MI: The University of Michigan Transportation Research Institute (also published as FHWA-RD-94-088, McLean, VA: U.S. Department of Transportation, Federal Highway Administration, August, 1995).
Green, P. (2000a). Dealing with potential distractions from driver information systems. (SAE paper 2000-01-C008) Paper presented at the Convergence 2000 Conference, Dearborn, Michigan, October 16-18, 2000.
Green, P. (2000b). The human interface for ITS display and control systems: Developing international standards to promote safety and usability. Invited paper presented at the InternationalWorkshop on ITS Human Interface in Japan, Utsu, Japan, June 8, 2000.
Green, P. (2001). Variations in task performance between younger and older drivers: UMTRI research on telematics. Paper presented at the Association for the Advancement of Automotive Medicine Conference on Aging and Driving, Southfield, Michigan, February 19, 20, 2001.
Jaencke, L., Musial, F., Wogt, J., and Kalveram, K.T. (1994). Monitoring radio programs and time of day affect simulated car-driving performance. Perceptual & Motor Skills, 79(2):484-486.
Kryter, K.D. (1972). Speech communication. In H.P. Van Cott and R.G. Kinkade (Eds.), Human Engineering Guide to Equipment Design, Washington, DC: US Government Printing Office, pp. 161-226.
Lai, J., Wood, D., and Considine, M. (2000). The effect of task conditions on the comprehensibility of synthetic speech, ACM SIGCHI CHI 2000 Proceedings, pp. 321-328.
Lee, J.D., Caven, B., Haake, S., and Brown, T.L. (submitted toHuman Factors), Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers attention to the roadway. [http://www-nrd.nhtsa.dot.gov/driver-distraction/PDF/27.PDF].
Logan, J.S., Greene, B.G., and Pisoni, D.B. (1989). Segmental intelligibility of synthetic speech produced by rule. Journal of the Acoustical Society of America, 86(2):566-581.
Morrison, H.B. and Casali, J.G. (1994). Intelligibility of synthesized voice messages in commercial truck cab noise for normal-hearing and hearing-impaired listeners. Proceedings of the 1994 Human Factors and Ergonomics Society 38th Annual Conference. Santa Monica, CA: Human Factors and Ergonomics Society, pp. 801-805.
Morrison, H.B. and Casali, J.G. (1995). Interior noise levels and intelligibility of synthesized speech messages in 1993-vintage commercial truck cabs Proceedings of the 20th Annual National Hearing Conservation Conference III/XX. Cincinnati, OH: The National Hearing Conservation Association, pp. 150-156.
Olson, A. and Green, P. (1997). A description of the UMTRI driving simulator architecture and alternatives (Technical Report UMTRI-97-15). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Richardson, B. and Green, P. (2000). Trends in North American intelligent transportation systems: A year 2000 appraisal (Technical Report 2000-9). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Tsimhoni, O. and Green, P. (1999). Visual Demand of Driving Curves Determined by Visual Occlusion. Paper presented at the Vision in Vehicles 8 Conference. Boston, MA: August 22-25.
Tsimhoni, O., Yoo, H., and Green, P. (1999). Effects of workload and task complexity on driving and task performance for in-vehicle displays as assessed by visual occlusion (Technical Report UMTRI-99-37). Ann Arbor, MI: University of Michigan Transportation Research Institute.
Tsimhoni, O., Green, P., and Lai, J. (2000). Listening to synthetic and natural speech while driving: Effects on user performance (Technical Report UMTRI 2000-31). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Tsimhoni, O., Green, P. & Lai, J. Listening to Natural and Synthesized Speech while Driving: Effects on User Performance. International Journal of Speech Technology 4, 155–169 (2001). https://doi.org/10.1023/A:1011387612112
Issue Date:
DOI: https://doi.org/10.1023/A:1011387612112