Article

GSI demo: multiuser gesture/speech interaction over digital tables by wrapping single user applications

Authors:
Edward Tse

University of Calgary, Calgary, Alberta, Canada and Mitsubishi Electric Research Laboratories, Cambridge, Massachusetts

University of Calgary, Calgary, Alberta, Canada and Mitsubishi Electric Research Laboratories, Cambridge, Massachusetts
View Profile

,
Saul Greenberg

University of Calgary, Calgary, Alberta, Canada

University of Calgary, Calgary, Alberta, Canada
View Profile

,
Chia Shen

Mitsubishi Electric Research Laboratories, Cambridge, Massachusetts

Mitsubishi Electric Research Laboratories, Cambridge, Massachusetts
View Profile

ICMI '06: Proceedings of the 8th international conference on Multimodal interfacesNovember 2006Pages 76–83https://doi.org/10.1145/1180995.1181012

Published:02 November 2006Publication History

ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces

Pages 76–83

ABSTRACT

Most commercial software applications are designed for a single user using a keyboard/mouse over an upright monitor. Our interest is exploiting these systems so they work over a digital table. Mirroring what people do when working over traditional tables, we want to allow multiple people to interact naturally with the tabletop application and with each other via rich speech and hand gestures. In previous papers, we illustrated multi-user gesture and speech interaction on a digital table for geospatial applications -- Google Earth, Warcraft III and The Sims. In this paper, we describe our underlying architecture: GSI Demo. First, GSI Demo creates a run-time wrapper around existing single user applications: it accepts and translates speech and gestures from multiple people into a single stream of keyboard and mouse inputs recognized by the application. Second, it lets people use multimodal demonstration -- instead of programming -- to quickly map their own speech and gestures to these keyboard/mouse inputs. For example, continuous gestures are trained by saying "Computer, when I do [one finger gesture], you do [mouse drag]". Similarly, discrete speech commands can be trained by saying "Computer, when I say [layer bars], you do [keyboard and mouse macro]". The end result is that end users can rapidly transform single user commercial applications into a multi-user, multimodal digital tabletop system.

References

Boyle, M. and Greenberg, S. Rapidly Prototyping Multimedia Groupware. Proc Distributed Multimedia Systems (DMS'05), Knowledge Systems Institute, 2005.Google Scholar
Cao, X. and Balakrishnan, R. Evaluation of an online adaptive gesture interface with command prediction. Proc Graphics Interface, 2005. 187--194. Google ScholarDigital Library
Cohen, P. R., Coulston, R. and Krout, K., Multimodal interaction during multiparty dialogues: Initial results. Proc IEEE Int'l Conf. Multimodal Interfaces, 2002, 448--452. Google ScholarDigital Library
Cohen, P. R., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L. and Clow, J., QuickSet: Multimodal interaction for distributed applications. Proc. ACM Multimedia, 1997, 31--40. Google ScholarDigital Library
Cypher, A. Watch What I Do: Programming by Demonstration. MIT Press, 1993. Google ScholarDigital Library
Dietz, P. H., Leigh, D. L., DiamondTouch: A Multi-User Touch Technology, Proc ACM UIST, 2001. 219--226. Google ScholarDigital Library
Greenberg, S. and Boyle, M. Customizable physical interfaces for interacting with conventional applications. Proc. ACM UIST Conference, 2002, 31--40. Google ScholarDigital Library
Gutwin, C. and Greenberg, S. Design for individuals, design for groups: Tradeoffs between power and workspace awareness. Proc ACM CSCW, 1998, 207--216. Google ScholarDigital Library
Lunsford, R., Oviatt, S., and Coulston, R., Audio-visual cues distinguishing self- from system-directed speech in younger and older adults. Proc. ICMI, 2005, 167--174. Google ScholarDigital Library
McGee, D. R. and Cohen, P. R., Creating tangible interfaces by augmenting physical objects with multimodal language. Proc ACM Conf. Intelligent User Interfaces, 2001, 113--119. Google ScholarDigital Library
Oviatt, S. L. Ten myths of multimodal interaction, Comm. ACM, 42(11), 1999, 74--81. Google ScholarDigital Library
Tse, E., Shen, C., Greenberg, S. and Forlines, C. Enabling Interaction with Single User Applications through Speech and Gestures on a Multi-User Tabletop. Proc. AVI 2006. Google ScholarDigital Library
Tse, E., Greenberg, S., Shen, C. and Forlines, C. (2006) Multimodal Multiplayer Tabletop Gaming. Proc. Workshop on Pervasive Games 2006.Google Scholar
Wu, M., Shen, C., Ryall, K., Forlines, C., Balakrishnan, R., Gesture Registration, Relaxation, and Reuse for Multi-Point Direct-Touch Surfaces, Proc. TableTop2006. 183--190. Google ScholarDigital Library

Index Terms

GSI demo: multiuser gesture/speech interaction over digital tables by wrapping single user applications
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms

Recommendations

Sensor synaesthesia: touch in motion, and motion in touch
CHI '11: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

We explore techniques for hand-held devices that leverage the multimodal combination of touch and motion. Hybrid touch + motion gestures exhibit interaction properties that combine the strengths of multi-touch with those of motion-sensing. This affords ...
Read More
VoicePen: augmenting pen input with simultaneous non-linguisitic vocalization
ICMI '07: Proceedings of the 9th international conference on Multimodal interfaces

This paper explores using non-linguistic vocalization as an additional modality to augment digital pen input on a tablet computer. We investigated this through a set of novel interaction techniques and a feasibility study. Typically, digital pen users ...
Read More
PERCs Demo: Persistently Trackable Tangibles on Capacitive Multi-Touch Displays
ITS '15: Proceedings of the 2015 International Conference on Interactive Tabletops & Surfaces

Tangible objects on capacitive multi-touch surfaces are usually only detected while the user is touching them. When the user lets go of such a tangible, the system cannot distinguish whether the user just released the tangible, or picked it up and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces
November 2006
404 pages
ISBN:159593541X
DOI:10.1145/1180995
General Chairs:
Francis Quek
Virginia Tech, USA
,
Jie Yang
Carnegie Mellon University, USA
,
Program Chairs:
Dominic Massaro
University of California, Santa Cruz, USA
,
Abeer Alwan
University of California, Los Angeles, USA
,
Timothy J. Hazen
Massachusetts Institute of Technology, USA
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 November 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
digital tables
multimodal input
programming by demonstration
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate453of1,080submissions,42%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 15
  Total Citations
  View Citations
- 543
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

GSI demo: multiuser gesture/speech interaction over digital tables by wrapping single user applications

ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces

ABSTRACT

References

Cited By

Index Terms

Recommendations

Sensor synaesthesia: touch in motion, and motion in touch

VoicePen: augmenting pen input with simultaneous non-linguisitic vocalization

PERCs Demo: Persistently Trackable Tangibles on Capacitive Multi-Touch Displays

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

GSI demo: multiuser gesture/speech interaction over digital tables by wrapping single user applications

ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces

ABSTRACT

References

Cited By

Index Terms

Recommendations

Sensor synaesthesia: touch in motion, and motion in touch

VoicePen: augmenting pen input with simultaneous non-linguisitic vocalization

PERCs Demo: Persistently Trackable Tangibles on Capacitive Multi-Touch Displays

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media