Skip to main content
Log in

Listening to Natural and Synthesized Speech while Driving: Effects on User Performance

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

The effects of message type (navigation, E-mail, news story), voice type (text-to-speech, natural human speech), and earcon cueing (present, absent) on message comprehension and driving performance were examined. Twenty-four licensed drivers (12 under 30, 12 over 65, both equally divided by gender) participated in the experiment. They drove the UMTRI driving simulator on a road consisting of straight sections and constant radius curves, thus yielding two levels of low driving-workload. In addition, as a control condition, data were collected while participants were parked. In all conditions, participants were presented with three types of messages. Each message was immediately followed by a series of questions to assess comprehension. Navigation messages were about 4 seconds long (about 9 words). E-mail messages were about 40 seconds long (about 100 words) and news messages were about 80 seconds long (about 225 words). For all message types, comprehension of text-to-speech messages, as determined by accuracy of response to questions, and by subjective ratings, was significantly worse than comprehension of natural speech (79 versus 83 percent correct answers; 7.7/10 versus 8.6/10 subjective rating). Driving workload did not affect comprehension. Interestingly, neither the speech used (synthesized or natural) nor the message type (navigation, E-mail, news) had a significant effect on basic driving performance measured by the standard deviations of lateral lane position and steering wheel angle.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Belz, S.M., Winters, J.J., Robinson, G.S., and Casali, J.G. (1997). Auditory icons: A new class of transportation subsystem(SAE paper 973185), Warrendale, PA: Society of Automotive Engineers.

    Google Scholar 

  • Brown, I.D. (1965). Effect of a car radio on driving in traffic.Ergonomics, 8(4):475-479.

    Google Scholar 

  • Bruno, A. (1999). Auto industry drives telematic services.RCRRadio Communications Report, 18(37):40-41.

    Google Scholar 

  • Fleming, J., Green, P., and Katz, S. (1998). Driver performance and memory for traffic messages: Effects of the number of messages, audio quality, and relevance (Technical Report UMTRI-98-22). Ann Arbor, MI: The University of Michigan Transportation Research Institute.

    Google Scholar 

  • Francis, A.L. and Nusbaum, H.C. (1999). Evaluating the quality of synthetic speech. In D. Gardner-Bonneau (Ed.), Human Factors and Voice Interactive Systems. Boston, MA: Kluwer Academic Publishers, pp. 63-97.

    Google Scholar 

  • Goodman, M., Bents, F.D., Tijerina, L., Wierwille, W., Lerner, N., and Benel, D. (1997). An investigation of the safety implications of wireless communication in vehicles (Technical Report DOT HS 808 635). Washington, D.C.: U.S. Department of Transportation (http://www.nhtsa.dot.gov/people/injury/research/wireless/).

    Google Scholar 

  • Green, P. (1993). Measures and methods used to assess the safety and usability of driver information systems (Technical Report UMTRI-93-12). Ann Arbor, MI: The University of Michigan Transportation Research Institute (also published as FHWA-RD-94-088, McLean, VA: U.S. Department of Transportation, Federal Highway Administration, August, 1995).

    Google Scholar 

  • Green, P. (2000a). Dealing with potential distractions from driver information systems. (SAE paper 2000-01-C008) Paper presented at the Convergence 2000 Conference, Dearborn, Michigan, October 16-18, 2000.

    Google Scholar 

  • Green, P. (2000b). The human interface for ITS display and control systems: Developing international standards to promote safety and usability. Invited paper presented at the InternationalWorkshop on ITS Human Interface in Japan, Utsu, Japan, June 8, 2000.

    Google Scholar 

  • Green, P. (2001). Variations in task performance between younger and older drivers: UMTRI research on telematics. Paper presented at the Association for the Advancement of Automotive Medicine Conference on Aging and Driving, Southfield, Michigan, February 19, 20, 2001.

  • Jaencke, L., Musial, F., Wogt, J., and Kalveram, K.T. (1994). Monitoring radio programs and time of day affect simulated car-driving performance. Perceptual & Motor Skills, 79(2):484-486.

    Google Scholar 

  • Kryter, K.D. (1972). Speech communication. In H.P. Van Cott and R.G. Kinkade (Eds.), Human Engineering Guide to Equipment Design, Washington, DC: US Government Printing Office, pp. 161-226.

    Google Scholar 

  • Lai, J., Wood, D., and Considine, M. (2000). The effect of task conditions on the comprehensibility of synthetic speech, ACM SIGCHI CHI 2000 Proceedings, pp. 321-328.

  • Lee, J.D., Caven, B., Haake, S., and Brown, T.L. (submitted toHuman Factors), Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers attention to the roadway. [http://www-nrd.nhtsa.dot.gov/driver-distraction/PDF/27.PDF].

  • Logan, J.S., Greene, B.G., and Pisoni, D.B. (1989). Segmental intelligibility of synthetic speech produced by rule. Journal of the Acoustical Society of America, 86(2):566-581.

    Google Scholar 

  • Morrison, H.B. and Casali, J.G. (1994). Intelligibility of synthesized voice messages in commercial truck cab noise for normal-hearing and hearing-impaired listeners. Proceedings of the 1994 Human Factors and Ergonomics Society 38th Annual Conference. Santa Monica, CA: Human Factors and Ergonomics Society, pp. 801-805.

    Google Scholar 

  • Morrison, H.B. and Casali, J.G. (1995). Interior noise levels and intelligibility of synthesized speech messages in 1993-vintage commercial truck cabs Proceedings of the 20th Annual National Hearing Conservation Conference III/XX. Cincinnati, OH: The National Hearing Conservation Association, pp. 150-156.

    Google Scholar 

  • Olson, A. and Green, P. (1997). A description of the UMTRI driving simulator architecture and alternatives (Technical Report UMTRI-97-15). Ann Arbor, MI: The University of Michigan Transportation Research Institute.

    Google Scholar 

  • Richardson, B. and Green, P. (2000). Trends in North American intelligent transportation systems: A year 2000 appraisal (Technical Report 2000-9). Ann Arbor, MI: The University of Michigan Transportation Research Institute.

    Google Scholar 

  • Tsimhoni, O. and Green, P. (1999). Visual Demand of Driving Curves Determined by Visual Occlusion. Paper presented at the Vision in Vehicles 8 Conference. Boston, MA: August 22-25.

  • Tsimhoni, O., Yoo, H., and Green, P. (1999). Effects of workload and task complexity on driving and task performance for in-vehicle displays as assessed by visual occlusion (Technical Report UMTRI-99-37). Ann Arbor, MI: University of Michigan Transportation Research Institute.

    Google Scholar 

  • Tsimhoni, O., Green, P., and Lai, J. (2000). Listening to synthetic and natural speech while driving: Effects on user performance (Technical Report UMTRI 2000-31). Ann Arbor, MI: The University of Michigan Transportation Research Institute.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tsimhoni, O., Green, P. & Lai, J. Listening to Natural and Synthesized Speech while Driving: Effects on User Performance. International Journal of Speech Technology 4, 155–169 (2001). https://doi.org/10.1023/A:1011387612112

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1011387612112

Navigation