A Tool for Interactive Advice on the Use of Speech in Multimodal Systems

Luz, Saturnino; Bernsen, Niels Ole

doi:10.1023/A:1011183800658

Saturnino Luz¹ &
Niels Ole Bernsen²

55 Accesses
2 Citations
Explore all metrics

Abstract

With the recent spread of speech technologies and the increasing availability of application program interfaces for speech synthesis and recognition, system designers are starting to consider whether to add speech functionality to their applications. The questions that ensue are by no means trivial. SMALTO, the tool described below, provides advice on the use of speech input and/or output modalities in combination with other modalities in the design of multimodal systems. SMALTO (S peech M odality A uxi L iary TO ol), implements a theory of modalities and incorporates structured data extracted from a corpus of claims about speech functionality found in recent literature on multimodality. The current version of the system aims mainly at supporting decisions at early design stages, as a hypertext system. However, further uses of SMALTO as part of a complete domain-oriented design environment are also envisaged.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Extensible Multimodal Annotation for Intelligent Interactive Systems

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

SpeechTyper: From Speech to Typographic Composition

References

N.O. Bernsen, “Towards a Tool for Predicting Speech Functionality, ” Speech Communication, vol. 23, 1997, pp. 181–210.
Article Google Scholar
N.O. Bernsen and L. Dybkjær, “Is Speech the Right Thing for Your Application?, ” in Proceedings of ICSLP' 98, Sydney, Australia, Australian Speech Science and Technology Association, 1998, pp. 3209–3212.
N.O. Bernsen, “Defining a Taxonomy of Output Modalities from an HCI Perspective, ” Computer Standards and Interfaces, vol. 18, no. 6/7, 1997, pp. 537–556.
Article Google Scholar
C. Baber and J. Noyes (Eds.), Interactive Speech Technology, London: Taylor and Francis, 1993.
Google Scholar
N.O. Bernsen and L. Dybkjær, “A Theory of Speech in Multimodal Systems, ” in Proceedings of the ESCA Workshop on Interactive Dialogue in Multi-Modal Systems, P. Dalsgaard, C.-H. Lee, P. Heisterkamp, and R. Cole (Eds.), Irsee, Germany, European Speech Communication Association, 1999, pp. 105–108.
Google Scholar
E.D. Mynatt, “Transforming Graphical Interfaces into Auditory Interfaces for Blind Users, ” Human-Computer Interaction, vol. 12, no. 1/2, 1997, pp. 7–45.
Article Google Scholar
S.F. Roth, M.C. Chuah, S. Kerpedjiev, J.A. Kolojejchick, and P. Lucas, “Toward an Information Visualization Workspace: Combining Multiple Means of Expression, ” Human-Computer Interaction, vol. 12, no. 1/2, 1997, pp. 131–185.
Article Google Scholar
N.O. Bernsen and L. Dybkjær, “Working Paper on Speech Functionality, ” Tech. Rep. D2.7, NIS Laboratory, DISC Spoken Language Dialogue Systems and Components: Best practice in development and evaluation, March 1999.
V. Apparao, S. Byrne, M. Champion, S. Isaacs, A.L. Hors, G. Nicol, J. Robie, P. Sharpe, B. Smith, J. Sorensen, R. Sutor, R. Whitmer, and C. Wilson, “Document Object Model (DOM) Level 1 Specification, ” Tech. Rep., The World Wide Web Consortium (W3C), 1998. Version 1.0. Available at http://www.w3. org/TR/REC-DOM-Level-1/.
G. Fischer, A. Girgensohn, K. Nakakoji, and D. Redmiles, “Supporting Software Designers with Integrated Domain-Oriented Design Environments, ” IEEE Transactions on Software Engineering, vol. 18, 1992, pp. 511–522.
Article Google Scholar
J.E. Robbins, D.M. Hilbert, and D.F. Redmiles, “Software Architecture Critics in Argo, ” in Proceedings of the 1998 International Conference on Intelligent User Interfaces, Adaptation and Critiquing, 1998, pp. 141–144.

Download references

Author information

Authors and Affiliations

Department of Computer Science, Trinity College, Ireland
Saturnino Luz
Natural Interactive Systems Laboratory, University of Southern Denmark, Odense, Denmark
Niels Ole Bernsen

Authors

Saturnino Luz
View author publications
You can also search for this author in PubMed Google Scholar
Niels Ole Bernsen
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luz, S., Bernsen, N.O. A Tool for Interactive Advice on the Use of Speech in Multimodal Systems. The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology 29, 129–137 (2001). https://doi.org/10.1023/A:1011183800658

Download citation

Published: 01 August 2001
Issue Date: August 2001
DOI: https://doi.org/10.1023/A:1011183800658

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A Tool for Interactive Advice on the Use of Speech in Multimodal Systems

Abstract

Access this article

Similar content being viewed by others

Extensible Multimodal Annotation for Intelligent Interactive Systems

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

SpeechTyper: From Speech to Typographic Composition

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Tool for Interactive Advice on the Use of Speech in Multimodal Systems

Abstract

Access this article

Similar content being viewed by others

Extensible Multimodal Annotation for Intelligent Interactive Systems

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

SpeechTyper: From Speech to Typographic Composition

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation