skip to main content
10.1145/1228716.1228732acmconferencesArticle/Chapter ViewAbstractPublication PageshriConference Proceedingsconference-collections
Article

Improving human-robot interaction through adaptation to the auditory scene

Authors Info & Claims
Published:10 March 2007Publication History

ABSTRACT

Effective communication with a mobile robot using speech is a difficult problem even when you can control the auditory scene. Robot ego-noise, echoes, and human interference are all common sources of decreased intelligibility. In real-world environments, however, these common problems are supplemented with many different types of background noise sources. For instance, military scenarios might be punctuated by high decibel plane noise and bursts from weaponry that mask parts of the speech output from the robot. Even in non-military settings, however, fans, computers, alarms, and transportation noise can cause enough interference that they might render a traditional speech interface unintelligible. In this work, we seek to overcome these problems by applying robotic advantages of sensing and mobility to a text-to-speech interface. Using perspective taking skills to predict how the human user is being affected by new sound sources, a robot can adjust its speaking patterns and/or reposition itself within the environment to limit the negative impact on intelligibility, making a speech interface easier to use.

References

  1. Junqua, J-C. The Lombard Reflex and its Role on Human Listeners and Automatic Speech Recognizers. J. Acoustical Society Of America, 93, 1 (1993), 510--524.Google ScholarGoogle ScholarCross RefCross Ref
  2. Martinson, E. and Brock, D. "Auditory Perspective Taking", Proceeding of the 1st ACM SIGCHI/SIGART conference on Human-robot interaction, Salt Lake City, UT, March 2006, 345--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sofge, D., et al., "Collaborating with Humanoid Robots in Space". International Journal of Humanoid Robotics, 2,2 (2005), 181--201.Google ScholarGoogle ScholarCross RefCross Ref
  4. Trafton, J.G., et al., "Enabling effective human-robot interaction using perspective-taking in robots". IEEE Trans. on Systems, Man and Cybernetics, Part A, 35, 4(2005), 460--470. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Hiatt, L., Trafton, J., Harrison, A., Schultz, A. A Cognitive Model for Spatial Perspective Taking. In International Conference on Cognitive Modeling. Mahwah, NJ. 2004, 354--355.Google ScholarGoogle Scholar
  6. Perzanowski, D., et al., Communicating with teams of cooperative robots. In Multi-Robot Systems: From Swarms to Intelligent Automata, A. Schultz and L. Parker, eds. 2002, Kluwer: The Netherlands, 16--20.Google ScholarGoogle Scholar
  7. Brown, G. and Wang, D. "Separation of Speech by Computational Auditory Scene Analysis", Speech Enhancement, J. Benesty, S. Makino and J. Chen (Eds.), Springer, New York, 2005, 371--402.Google ScholarGoogle ScholarCross RefCross Ref
  8. Brock, D.P. and J.A. Ballas. Audio in VR: Beyond entertainment setups and telephones. In Proceedings of International Conference on Human-Computer Interaction. Las Vegas, NV, 2005.Google ScholarGoogle Scholar
  9. Langner, B., and Black, A. Using Speech in Noise to Improve Understandability for Elderly Listeners, ASRU 2005, San Juan, Puerto Rico, 2005, 112--116.Google ScholarGoogle ScholarCross RefCross Ref
  10. D. Pan, B. Heng, S. Cheung, and E. Chang, Improving Speech Synthesis for High Intelligibility under Adverse Conditions. In Proceedings of the 6th International Conference on Spoken Language Processing, Beijing, China, October 2000.Google ScholarGoogle ScholarCross RefCross Ref
  11. Schultz, A., and Adams, W. Continuous localization using evidence grids. In Proceedings of IEEE International Conf. on Robotics and Automation, Leuven, Belgium, 1998, 2833--2839.Google ScholarGoogle ScholarCross RefCross Ref
  12. Yamamoto, S., et al., "Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory". Proceeding of Int. Conf. on Robotics and Automation (ICRA), Barcelona, Spain 2005.Google ScholarGoogle Scholar
  13. G. Bradski, A. Kaehler, and V. Pisarevsky, Learning-based computer vision with intel's open source computer vision library. In Intel Technology Journal, 9,1, (May 2005).Google ScholarGoogle Scholar
  14. Quatiri, T. Discrete Time Speech Signal Processing, Pearson Education, Dehli, India, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. B. Mungamuru and P. Aarabi, Enhanced Sound Localization, IEEE Trans. on Systems, Man, and Cybernetics, 34, 2004, 1526--1540. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Martinson, E. and Schultz, A. "Auditory Evidence Grids," to be published in Proceeding of Int. Conf. on Intelligent Robots and Systems (IROS), Beijing, China 2006.Google ScholarGoogle Scholar
  17. Martinson, E. and Arkin, R. Noise Maps for Acoustically Sensitive Navigation. Proceedings of SPIE, 5609 (December 2004),50--60.Google ScholarGoogle ScholarCross RefCross Ref
  18. K. Hughes, A. Tokuta, and N. Ranganathan, "Trulla: An Agorithm for Path Planning Among Weighted Regions by Localized Propogations," Proceedings of Int. Conf. on Intelligent Robots and Systems (IROS), Raleigh, NC, 1992.Google ScholarGoogle Scholar

Index Terms

  1. Improving human-robot interaction through adaptation to the auditory scene

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      HRI '07: Proceedings of the ACM/IEEE international conference on Human-robot interaction
      March 2007
      392 pages
      ISBN:9781595936172
      DOI:10.1145/1228716

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 March 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      HRI '07 Paper Acceptance Rate22of101submissions,22%Overall Acceptance Rate242of1,000submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader