Regular Article
Visual Tools for Natural Language Processing

https://doi.org/10.1006/jvlc.2000.0203Get rights and content

Abstract

We describe GATE, the General Architecture for Text Engineering, an integrated visual development environment to support the visual assembly, execution and analysis of modular natural language processing systems. The visual model is an executable data flow program graph, automatically synthesised from data dependency declarations of language processing modules. The graph is then directly executable: modules are run interactively in the graph, and results are accessible via generic text visualisation tools linked to the modules. These tools lighten the ‘cognitive load’ of viewing and comparing module results by relating data produced by modules back to the underlying text, by reducing the amount of search in examining results, and by displaying results in context. Overall, the GATE integrated visual development environment leads to rapid understanding of system behaviour and hence to rapid system refinement, therefore demonstrating the utility of visual programming and visualisation techniques for the development of natural language processing systems.

References (41)

  • D.D. Hils

    Visual languages and computing survey: data flow visual programming languages

    Journal of Visual Languages and Computing

    (1992)
  • S.-K. Chang

    Principles of Visual Programming Systems

    (1990)
  • J. Landauer et al.

    Visual AWK: a model for text processing by demonstration

    (1995)
  • J. McWhirter

    VGrep: a graphical tool for the exploration of textual documents

    (1995)
  • R. K. Larson, D. S. Warren, J. Freire de Lima e Silva, K. Sagonas, 1996, Syntactica, MIT Press, Cambridge,...
  • R. K. Larson, D. S. Warren, J. Freire de Lima e Silva, O. P. Gomez, K. Sagonas, 1997, Semantica, MIT Press, Cambridge,...
  • N.K. Simpkins

    An open architecture for language engineering

    First Language Engineering Convention

    (1994)
  • D. Day et al.

    Alembic workbench corpus development tool

    Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC’98)

    (1998)
  • W. Skut et al.

    A linguistically interpreted corpus of german newspaper texts

    Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC’98)

    (1998)
  • H.-H. Chen et al.

    A treebank development tool

    Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC’98)

    (1998)
  • O. Christ

    A modular and flexible architecture for an integrated corpus query system

    Proceedings of the 3rd Conference on Computational Lexicography and Text Research (COMPLEX’94)

    (1994)
  • A.L. Davis et al.

    Data flow program graphs

    IEEE Computer

    (1982)
  • S. B. Steinman, K. G. Carver, 1996, Visual Programming With Prograph CPX, Manning...
  • P.J. Rodgers et al.

    Visual execution and data visualisation in natural language processing

    (1997)
  • H. Cunningham, K. Humphreys, R. Gaizauskas, Y. Wilks, 1997, Software infrastructure for natural language processing,...
  • R. Gaizauskas et al.

    GATE—an environment to support research and development in natural language engineering

    Proceedings of the 8th IEEE International Conference on Tools with Artificial Intelligence (ICTAI-96)

    (1996)
  • R. Grishman, 1997, TIPSTER Architecture Design Document Version 2.3,...
  • C. F. Goldfarb, 1990, The SGML Handbook, Oxford University Press,...
  • R. Gaizauskas et al.

    Description of the LaSIE system as used for MUC-6

    Proceedings of the 6th Message Understanding Conference (MUC-6)

    (1995)
  • T. Agerwalak et al.

    Data flow systems: guest editor's introduction

    IEEE Computer

    (1982)
  • Cited by (3)

    View full text