Listening to Natural and Synthesized Speech while Driving: Effects on User Performance

Tsimhoni, Omer; Green, Paul; Lai, Jennifer

doi:10.1023/A:1011387612112

Listening to Natural and Synthesized Speech while Driving: Effects on User Performance

Published: June 2001

Volume 4, pages 155–169, (2001)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Omer Tsimhoni¹,
Paul Green¹ &
Jennifer Lai²

172 Accesses
10 Citations
Explore all metrics

Abstract

The effects of message type (navigation, E-mail, news story), voice type (text-to-speech, natural human speech), and earcon cueing (present, absent) on message comprehension and driving performance were examined. Twenty-four licensed drivers (12 under 30, 12 over 65, both equally divided by gender) participated in the experiment. They drove the UMTRI driving simulator on a road consisting of straight sections and constant radius curves, thus yielding two levels of low driving-workload. In addition, as a control condition, data were collected while participants were parked. In all conditions, participants were presented with three types of messages. Each message was immediately followed by a series of questions to assess comprehension. Navigation messages were about 4 seconds long (about 9 words). E-mail messages were about 40 seconds long (about 100 words) and news messages were about 80 seconds long (about 225 words). For all message types, comprehension of text-to-speech messages, as determined by accuracy of response to questions, and by subjective ratings, was significantly worse than comprehension of natural speech (79 versus 83 percent correct answers; 7.7/10 versus 8.6/10 subjective rating). Driving workload did not affect comprehension. Interestingly, neither the speech used (synthesized or natural) nor the message type (navigation, E-mail, news) had a significant effect on basic driving performance measured by the standard deviations of lateral lane position and steering wheel angle.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speech-Based Text Correction Patterns in Noisy Environment

Don’t Text While Driving: The Effect of Smartphone Text Messaging on Road Safety during Simulated Driving

Can User-Paced, Menu-free Spoken Language Interfaces Improve Dual Task Handling While Driving?

References

Belz, S.M., Winters, J.J., Robinson, G.S., and Casali, J.G. (1997). Auditory icons: A new class of transportation subsystem(SAE paper 973185), Warrendale, PA: Society of Automotive Engineers.
Google Scholar
Brown, I.D. (1965). Effect of a car radio on driving in traffic.Ergonomics, 8(4):475-479.
Google Scholar
Bruno, A. (1999). Auto industry drives telematic services.RCRRadio Communications Report, 18(37):40-41.
Google Scholar
Fleming, J., Green, P., and Katz, S. (1998). Driver performance and memory for traffic messages: Effects of the number of messages, audio quality, and relevance (Technical Report UMTRI-98-22). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Google Scholar
Francis, A.L. and Nusbaum, H.C. (1999). Evaluating the quality of synthetic speech. In D. Gardner-Bonneau (Ed.), Human Factors and Voice Interactive Systems. Boston, MA: Kluwer Academic Publishers, pp. 63-97.
Google Scholar
Goodman, M., Bents, F.D., Tijerina, L., Wierwille, W., Lerner, N., and Benel, D. (1997). An investigation of the safety implications of wireless communication in vehicles (Technical Report DOT HS 808 635). Washington, D.C.: U.S. Department of Transportation (http://www.nhtsa.dot.gov/people/injury/research/wireless/).
Google Scholar
Green, P. (1993). Measures and methods used to assess the safety and usability of driver information systems (Technical Report UMTRI-93-12). Ann Arbor, MI: The University of Michigan Transportation Research Institute (also published as FHWA-RD-94-088, McLean, VA: U.S. Department of Transportation, Federal Highway Administration, August, 1995).
Google Scholar
Green, P. (2000a). Dealing with potential distractions from driver information systems. (SAE paper 2000-01-C008) Paper presented at the Convergence 2000 Conference, Dearborn, Michigan, October 16-18, 2000.
Google Scholar
Green, P. (2000b). The human interface for ITS display and control systems: Developing international standards to promote safety and usability. Invited paper presented at the InternationalWorkshop on ITS Human Interface in Japan, Utsu, Japan, June 8, 2000.
Google Scholar
Green, P. (2001). Variations in task performance between younger and older drivers: UMTRI research on telematics. Paper presented at the Association for the Advancement of Automotive Medicine Conference on Aging and Driving, Southfield, Michigan, February 19, 20, 2001.
Jaencke, L., Musial, F., Wogt, J., and Kalveram, K.T. (1994). Monitoring radio programs and time of day affect simulated car-driving performance. Perceptual & Motor Skills, 79(2):484-486.
Google Scholar
Kryter, K.D. (1972). Speech communication. In H.P. Van Cott and R.G. Kinkade (Eds.), Human Engineering Guide to Equipment Design, Washington, DC: US Government Printing Office, pp. 161-226.
Google Scholar
Lai, J., Wood, D., and Considine, M. (2000). The effect of task conditions on the comprehensibility of synthetic speech, ACM SIGCHI CHI 2000 Proceedings, pp. 321-328.
Lee, J.D., Caven, B., Haake, S., and Brown, T.L. (submitted toHuman Factors), Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers attention to the roadway. [http://www-nrd.nhtsa.dot.gov/driver-distraction/PDF/27.PDF].
Logan, J.S., Greene, B.G., and Pisoni, D.B. (1989). Segmental intelligibility of synthetic speech produced by rule. Journal of the Acoustical Society of America, 86(2):566-581.
Google Scholar
Morrison, H.B. and Casali, J.G. (1994). Intelligibility of synthesized voice messages in commercial truck cab noise for normal-hearing and hearing-impaired listeners. Proceedings of the 1994 Human Factors and Ergonomics Society 38th Annual Conference. Santa Monica, CA: Human Factors and Ergonomics Society, pp. 801-805.
Google Scholar
Morrison, H.B. and Casali, J.G. (1995). Interior noise levels and intelligibility of synthesized speech messages in 1993-vintage commercial truck cabs Proceedings of the 20th Annual National Hearing Conservation Conference III/XX. Cincinnati, OH: The National Hearing Conservation Association, pp. 150-156.
Google Scholar
Olson, A. and Green, P. (1997). A description of the UMTRI driving simulator architecture and alternatives (Technical Report UMTRI-97-15). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Google Scholar
Richardson, B. and Green, P. (2000). Trends in North American intelligent transportation systems: A year 2000 appraisal (Technical Report 2000-9). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Google Scholar
Tsimhoni, O. and Green, P. (1999). Visual Demand of Driving Curves Determined by Visual Occlusion. Paper presented at the Vision in Vehicles 8 Conference. Boston, MA: August 22-25.
Tsimhoni, O., Yoo, H., and Green, P. (1999). Effects of workload and task complexity on driving and task performance for in-vehicle displays as assessed by visual occlusion (Technical Report UMTRI-99-37). Ann Arbor, MI: University of Michigan Transportation Research Institute.
Google Scholar
Tsimhoni, O., Green, P., and Lai, J. (2000). Listening to synthetic and natural speech while driving: Effects on user performance (Technical Report UMTRI 2000-31). Ann Arbor, MI: The University of Michigan Transportation Research Institute.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Michigan Transportation Research Institute, Ann Arbor, MI, 48109-2150, USA
Omer Tsimhoni & Paul Green
IBM Corporation/T.J. Watson Research Center, Hawthorne, NY, 10598, USA
Jennifer Lai

Authors

Omer Tsimhoni
View author publications
You can also search for this author in PubMed Google Scholar
Paul Green
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Lai
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tsimhoni, O., Green, P. & Lai, J. Listening to Natural and Synthesized Speech while Driving: Effects on User Performance. International Journal of Speech Technology 4, 155–169 (2001). https://doi.org/10.1023/A:1011387612112

Download citation

Issue Date: June 2001
DOI: https://doi.org/10.1023/A:1011387612112

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Listening to Natural and Synthesized Speech while Driving: Effects on User Performance

Abstract

Access this article

Similar content being viewed by others

Speech-Based Text Correction Patterns in Noisy Environment

Don’t Text While Driving: The Effect of Smartphone Text Messaging on Road Safety during Simulated Driving

Can User-Paced, Menu-free Spoken Language Interfaces Improve Dual Task Handling While Driving?

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation