We present in this chapter the QALC system which has participated in the four TREC QA evaluations. We focus here on the problem of linguistic variation in order to be able to relate questions and answers. We present first, variation at the term level which consists in retrieving questions terms in document sentences even if morphologic, syntactic or semantic variations alter them. Our second subject matter concerns variation at the sentence level that we handle as different partial reformulations of questions. Questions are associated with extraction patterns based on the question syntactic type and the object that is under query. We present the whole system thus allowing situating how QALC deals with variation, and different evaluations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
10. References
Abney, S. (1996), Partial Parsing via Finite-State Cascades. Natural Language Engineering,2 (4), 337-344.
Aït-Mokhtar, S. & Chanod, J-P. (1997). Incremental finite-state parsing. Proceedings of Applied Natural Language, Washington, DC.
Aït-Mokhtar, S., Chanod, J-P. & Roux, C. (2002). Robustness beyond Shallowness: Incremental Deep Parsing. Natural Language Engineering, special issue on Robust Methods in Analysis of Natural Language Data, to appear.
Alpha, S., Dixon, P., Liao, C. & Yang, C. (2001). Oracle at TREC 10: Filtering and Question-Answering. In: Proceedings of Text retrieval conference, TREC 10. Gaithersburg, MD. NIST Eds, pp. 23
Appelt, D., Hobbs, J., Bear, J., Israel, D., Kameyama, M., Kehler, A. et al. (1995 ) SRI International FASTUS system: MUC-6 test results and analysis. Proceedings of the 6 th Message Understanding Conference (MUC-6), Morgan Kaufman, pp. 237-248.
ARPA. (1992). Advanced Research Projects Agency. Proceedings of the Fourth Message Understanding Conference (MUC-4). San Francisco, California, Morgan Kaufman.
ARPA. (1996). Advanced Research Projects Agency. Proceedings of the Sixth Message Understanding Conference (MUC-6). San Francisco, California, Morgan Kaufman.
Barzilay, R. & McKeown, K. R. (2001). Extracting Paraphrases from a Parallel Corpus. Proceedings of ACL-EACL’01, Toulouse.
Berri, J., Mollá Aliod, D. & Hess M. (1998). Extraction automatique de réponses: implémentations du système ExtrAns. Proceedings of the fifth conference TALN 1998 (Traitement Automatique des Langues Naturelles), Paris, pp. 12-21.
Berwick, R.C. (1991). Principles of principle-based parsing. In Berwick, R.C., Abney, S.P., & Tenny, C. (Eds.), Principle-Based Parsing Computation and Psycholinguistics (pp. 1-38). Kluwer Academic.
Biber, D. (1993). Using Register-Diversified Corpora for General Language Studies, Computational Linguistics, 19 (2), 219-241.
Brill, E. (1993). A Corpus Based Approach To Language Learning, PhD Dissertation, Department of Computer and Information Science, University of Pennsylvania.
Busemann, S. (1996). Best-First Surface Realization, Proceedings of the 8 th International Workshop on Natural Language Generation, Herstmonceux, Great Britain.
Callaway, C.B. & Lester, J.C. (2001). Narrative Prose Generation. Proceedings of the 8th IJCAI 2001, Seattle.
CELEX. (1998). Consortium for Lexical Resources, University of Pennsylvania. From http://www. ldc.upenn.edu/readme_files/celex.readme.html.
Clarke, C.L.A., Cormack, G.V., Kisman, D.I.E. & Lynam, T.R., Question answering by passage selection, Proceedings of the Text retrieval conference, TREC9, Gaithersburg, MD. NIST Eds.
CLR (1998) Consortium for Lexical Resources, NMSUs, Eds., New Mexico. From http://crl.nmsu. edu/cgi-bin/Tools/CLR/clrcat#D3.
Collins, M. (1996). A New Statistical Parser Based on Bigram Lexical Dependencies. Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, ACL-96, pp.184-191.
Fabre, C. & Jacquemin, C. (2000). Boosting variant recognition with light semantics. Proceedings of COLING 2000, Luxemburg, pp. 264-270.
Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
Ferret, O., Grau, B., Hurault-Plantet, M., Illouz, G., Jacquemin, C. (2001). Document selection refinement based on linguistic features for QALC, a question answering system, Proceedings of the Euroconference on Recent Advances in Natural language Processing (RANLP), Tsigov Chark, Bulgaria.
Gaizauskas, R. & Wilks, Y. (1997). Information Extraction: Beyond Document Retrieval. Technical report CS-97-10, Department of Computer Science, University of Sheffield, UK.
van Halteren H., Zavrel J. and Daelemans W. (1998) Improving Data Driven Wordclass Tagging by System Combination, ACL-COLING’98, pp. 491-497.
Harabagiu, S., Pasca, M., Maiorano, J. (2000). Experiments with Open-Domain Textual Question Answering. Proceedings of Coling’2000, Saarbrucken, Germany.
Harabagiu, S., Moldovan, D., Pasca, M., Surdeanu, M., Mihalcea, R., Girju, R., Rus, V., Lactusu, F., Morarescu, P., Bunescu, R. (2001) Answering Complex, List and Context Questions with LCC’s Question-Answering Server, Proceedings of the Text retrieval conference, TREC 10, Gaithersburg, MD. NIST Eds., pp. 355-361.
Hobbs, J.R. (1993). The Generic Information Extraction System. Proceedings of the Fifth Message Understanding Conference (MUC-5) (pp. 87-91), Morgan Kaufman.
Hobbs Appelt, Bear Israel, Kameyama Stickel & Tyson (1996). FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text, in: Roche and Schabes, eds., Finite State Devices for Natural Language Processing, MIT Press, Cambridge MA.
Hovy, E. (1996). Language generation, overview. In Survey of the State of the Art in Human Language Technology (chap. 4.1). From http://cslu.cse.ogi.edu/HLTsurvey/HLTsurvey.html.
Hovy, E., Hermjacob, U. & Lin C-Y., Ravichandran, D. (2001a). Towards Semantics-Based Answer Pinpointing, DARPA Human Technology Conference (HLT), San Diego.
Hovy, E., Hermjacob, U. & Lin C-Y. (2001b). The Use of External Knowledge in Factoid QA. Proceedings of the Text retrieval conference, TREC 10, Gaithersburg, MD. NIST Eds. pp. 644-652.
Hull, D. (1996). Stemming algorithms: A case study for detailed evaluation, Journal of the American Society for Information Science, 47 (1), 70-84.
Ittycheriah, A., Franz, M., Zhu, W-J. & Ratnaparkhi A. (2000), IBM’s statistical Question Answering System. Pre-proceedings of TREC9, Gaithersburg, MD, NIST Eds, pp. 60-65.
Ittycheriah, A., Franz, M. & Roukos, S. (2001). IBM’s Statistical Question Answering System - TREC-10. Proceedings of the Text retrieval conference, TREC 10, Gaithersburg, MD. NIST Eds.
Jacquemin, C. (1999). Syntagmatic and paradigmatic representations of term variation. Proceedings of ACL’99, pp. 341-348.
Jacquemin, C. (2001). Spotting and Discovering Terms through NLP. Cambridge, MA: MIT Press.
Justeson, J. & Katz, S. (1995). Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering , 1, 9-27.
Lehnert, W. (1977). Human and computational question answering. Cognitive Science 1, 47-63.
Lin, D. & Pantel, P. (2001). Discovery of Inference Rules for Question-Answering. Natural Language Engineering 7 (4).
McKeown, K. (1985). Discourse strategies for generating natural-language text, Artificial Intelligence, 27, 1-41.
McRoy, S.W., Channarukul, S., & Ali, S.S. (2000). YAG: A Template-Based Generator for Real-Time Systems (System demonstration), Proceedings of the First International Natural Language Generation Conference (INLG), Mitze Ramon, Israel.
Poibeau, T. (2002). Extraction d’information à base de connaissances hybrides. Thèse de l’université Paris-Nord.
Prager J., Brown, E., Radev, D. & Czuba, K. (2000), One Search Engine or two for Question-Answering, Proceedings of the Text retrieval conference, TREC9, Gaithersburg, MD. NIST Eds, pp. 250-254.
Riloff, E. (1996). Automatically Generating Extraction Patterns from Untagged Text, Proceedings of the 13 th National Conference on Artificial Intelligence (AAAI-96), pp. 1044-1049.
Robin, J. (1994). Revision-based generation of natural language summaries providing historical background: corpus-based analysis, design, implementation and evaluation. Ph.D. Thesis. CUCS-034-94, Columbia University, Computer Science Department, New York, USA. 357p.
Sager, N. (1981). Natural Language Information Processing, Addison-Wesley, Reading.
Schmid, H. (1999). Improvements in Part-of-Speech Tagging with an Application To German. In Armstrong, S., Chuch, K.W., Isabelle, P., Tzoukermann, E. & Yarowski, D. (Eds.), Natural Language Processing Using Very Large Corpora, Dordrecht: Kluwer Academic Publisher.
Schwarz, C. (1988). The TINA Project: text content analysis at the Corporate Research Laboratories at Siemens. Proceedings of Intelligent Multimedia Information Retrieval Systems and Management (RIAO’88), Cambridge, MA, pp. 361-368.
Schwenk, H. & Gauvain, J.-L. (2000). Improved ROVER using Language Model Information. Proceedings of ISCA ITRW Workshop on Automatic Speech Recognition: Challenges for the new Millenium, pp. 47-52, Paris.
Sleator, D.D. & Temperley, D. (1991). Parsing English with a Link Grammar. Technical report CMU-CS-91-196, Carnegie Mellon University, School of Computer Science.
Soubbotin, M.M. & Soubbotin, S.M. (2001). Patterns of Potential Answer Expressions as Clues to the Right Answers. Proceedings of the Text retrieval conference, TREC 10, Gaithersburg, MD. NIST Eds.
Spark Jones, K. & Tait, J. (1984). Automatic search term variant generation. Journal of documentation, 40 (1), 50-66.
Spark Jones, K. (1999). The role of NLP in text retrieval. In T. Strzalkowski (Ed.) Natural Language Information Retrieval (pp. 1-24). Boston, MA, Kluwer.
Wehrli, E. (1992). The IPS system. In C. Boitet (Ed.) Coling-92, GETA, pp. 870-874.
Yangarber, R. & Grishman, R. (2000). Extraction Pattern Discovery through Corpus Analysis. Proceedings of the 2nd International Conference on Language Resources and Evaluation (LREC 2000), Workshop: Information Extraction meets Corpus Linguistics.
Zock, M. & Sabah, G. (2002). La génération automatique de textes. In M. Fayol (Ed.) La production du langage, Coll. Sciences du Langage, Hermes.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer
About this chapter
Cite this chapter
Grau, B. et al. (2008). Coping With Alternate Formulations Of Questions And Answers. In: Strzalkowski, T., Harabagiu, S.M. (eds) Advances in Open Domain Question Answering. Text, Speech and Language Technology, vol 32. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4746-6_6
Download citation
DOI: https://doi.org/10.1007/978-1-4020-4746-6_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4744-2
Online ISBN: 978-1-4020-4746-6
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)