skip to main content
10.1145/2388676.2388744acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
poster

Timing multimodal turn-taking for human-robot cooperation

Published:22 October 2012Publication History

ABSTRACT

In human cooperation, the concurrent usage of multiple social modalities such as speech, gesture, and gaze results in robust and efficient communicative acts. Such multimodality in combination with reciprocal intentions supports fluent turn-taking. I hypothesize that human-robot turn-taking can be made more fluent through appropriate timing of multimodal actions. Managing timing includes understanding the impact that timing can have on interactions as well as having a control system that supports the manipulation of such timing. To this end, I propose to develop a computational turn-taking model of the timing and information flow of reciprocal interactions. I also propose to develop an architecture based on the timed Petri net (TPN) for the generation of coordinated multimodal behavior, inside of which the turn-taking model will regulate turn timing and action initiation and interruption in order to seize and yield control. Through user studies in multiple domains, I intend to demonstrate the system's generality and evaluate the system on balance of control, fluency, and task effectiveness.

References

  1. B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, and B. Tomlinson. Integrated learning for interactive synthetic characters. In Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), pages 417--426, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Bohus and E. Horvitz. Facilitating multiparty dialog with gaze, gesture, and speech. In Proceedings of the 12th International Conference on Multimodal Interfaces (ICMI), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Cakmak, C. Chao, and A. L. Thomaz. Designing interactions for robot active learners. IEEE Transactions on Autonomous Mental Development, 2(2):108--118, June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Cassell, T. Bickmore, L. Campbell, K. Chang, H. Vilhjálmsson, and H. Yan. Requirements for an architecture for embodied conversational characters. In Computer Animation and Simulation, pages 109--120. Springer Verlag, 1999.Google ScholarGoogle Scholar
  5. C. Chao, J. H. Lee, M. Begum, and A. Thomaz. Simon plays Simon says: The timing of turn-taking in an imitation game. In IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2011.Google ScholarGoogle ScholarCross RefCross Ref
  6. C. Chao and A. Thomaz. Timing in multimodal reciprocal interactions: Control and analysis using timed Petri nets. Journal of Human-Robot Interaction, 1(1):46--67, 2012.Google ScholarGoogle Scholar
  7. H. H. Clark and M. A. Krych. Speaking while monitoring addressees for understanding. Journal of Memory and Language, 50(1):62--81, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. S. Duncan. On the structure of speaker-auditor interaction during speaking turns. Language in Society, 3(2):161--180, 1974.Google ScholarGoogle ScholarCross RefCross Ref
  9. G. Hoffman and C. Breazeal. Effects of anticipatory action on human-robot teamwork: Efficiency, fluency, and perception of team. In Proceedings of the 3rd ACM/IEEE Conference on Human-Robot Interaction (HRI), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. G. Holroyd. Generating engagement behaviors in human-robot interaction. Master's thesis, Worcester Polytechnic Institute, 2011.Google ScholarGoogle Scholar
  11. H. Kose-Bagci, K. Dautenhan, and C. L. Nehaniv. Emergent dynamics of turn-taking interaction in drumming games with a humanoid robot. In Proceedings of the 17th IEEE International Symposium on Robot and Human Interactive Communication, pages 346--353, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  12. B. Mutlu, T. Shiwa, T. K. H. Ishiguro, and N. Hagita. Footing in human-robot conversations: How robots might shape participant roles using gaze cues. In Proceedings of the 4th ACM/IEEE Conference on Human-Robot Interaction (HRI), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. Orestrom. Turn-taking in English conversation. CWK Gleerup, 1983.Google ScholarGoogle Scholar
  14. A. Raux and M. Eskenazi. A finite-state turn-taking model for spoken dialog systems. In Proceedings of the Human Language Technologies (HLT), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Sacks, E. Schegloff, and G. Jefferson. A simplest systematics for the organization of turn-taking for conversation. Language, 50:696--735, 1974.Google ScholarGoogle ScholarCross RefCross Ref
  16. E. Schegloff. Overlapping talk and the organization of turn-taking for conversation. Language in Society, 29(1):1--63, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  17. E. Tronick, H. Als, and L. Adamson. Structure of early face-to-face communicative interactions. In M. Bullowa, editor, Before Speech: The Beginning of Interpersonal Communication, pages 349--374. Cambridge University Press, Cambridge, 1979.Google ScholarGoogle Scholar

Index Terms

  1. Timing multimodal turn-taking for human-robot cooperation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICMI '12: Proceedings of the 14th ACM international conference on Multimodal interaction
        October 2012
        636 pages
        ISBN:9781450314671
        DOI:10.1145/2388676

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 October 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • poster

        Acceptance Rates

        Overall Acceptance Rate453of1,080submissions,42%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader