skip to main content
10.1145/2806416.2806490acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Assessing the Impact of Syntactic and Semantic Structures for Answer Passages Reranking

Published:17 October 2015Publication History

ABSTRACT

In this paper, we extensively study the use of syntactic and semantic structures obtained with shallow and deeper syntactic parsers in the answer passage reranking task. We propose several dependency-based structures enriched with Linked Open Data (LD) knowledge for representing pairs of questions and answer passages. We use such tree structures in learning to rank (L2R) algorithms based on tree kernel. The latter can represent questions and passages in a tree fragment space, where each substructure represents a powerful syntactic/semantic feature. Additionally since we define links between structures, tree kernels also generate relational features spanning question and passage structures. We derive very important findings, which can be useful to build state-of-the-art systems: (i) full syntactic dependencies can outperform shallow models also using external knowledge and (ii) the semantic information should be derived by effective and high-coverage resources, e.g., LD, and incorporated in syntactic structures to be effective. We demonstrate our findings by carrying out an extensive comparative experimentation on two different TREC QA corpora and one community question answer dataset, namely Answerbag. Our comparative analysis on well-defined answer selection benchmarks consistently demonstrates that our structural semantic models largely outperform the state of the art in passage reranking.

References

  1. E. Aktolga, J. Allan, and D. A. Smith. Passage reranking for question answering using syntactic structures and answer types. In ECIR, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. W. Bilotti, J. L. Elsas, J. Carbonell, and E. Nyberg. Rank learning for factoid question answering with linguistic and semantic constraints. In CIKM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. W. Bilotti and E. Nyberg. Improving text retrieval precision and answer accuracy in question answering systems. In (IR4QA) Workshop at COLING, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Bizer, J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak, and S. Hellmann. Dbpedia - a crystallization point for the web of data. Web Semantics, 7(3), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Bizer and A. Schultz. The berlin sparql benchmark. IJSWIS, 5(2):1--24, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  6. B. Bohnet. Top accuracy and fast dependency parsing is not a contradiction. In COLING, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. D. Choi and M. Palmer. Getting the most out of transition-based dependency parsing. In ACL, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Croce, A. Moschitti, and R. Basili. Structured lexical similarity via convolution kernels on dependency trees. In Proceedings of EMNLP, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Csomai and R. Mihalcea. Linking documents to encyclopedic knowledge. IEEE Intelligent Systems, 23(5):34--41, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. E. de Castilho and I. Gurevych. A broad-coverage collection of portable NLP components for building shareable analysis pipelines. In OIAF4HLT Workshop (COLING), 2014.Google ScholarGoogle ScholarCross RefCross Ref
  11. J. Fan, D. Ferrucci, D. Gondek, and A. Kalyanpur. Prismatic: Inducing knowledge from a large scale lexicalized relation resource. In NAACL HLT, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Fellbaum, editor. WordNet: An Electronic Lexical Database. MIT Press, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  13. D. Ferrucci, E. Brown, J. Chu-Carroll, J. Fan, D. Gondek, A. Kalyanpur, A. Lally, J. W. Murdock, E. Nyberg, J. Prager, N. Schlaefer, and C. Welty. Building Watson: An Overview of the DeepQA Project. AI Magazine, 31(3), 2010.Google ScholarGoogle Scholar
  14. M. Heilman and N. A. Smith. Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In NAACL, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Hickl, J. Williams, J. Bensley, K. Roberts, Y. Shi, and B. Rink. Question answering with lcc chaucer at trec 2006. In TREC, 2006.Google ScholarGoogle Scholar
  16. J. Hoffart, F. Suchanek, K. Berberich, and G. Weikum. Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. T. Joachims. Optimizing search engines using clickthrough data. In ACM SIGKDD KDD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Katz and J. Lin. Selectively using relations to improve precision in question answering. In Workshop on NLP for QA (EACL), 2003.Google ScholarGoogle Scholar
  20. X. Li and D. Roth. Learning question classifiers. In Proceedings of ACL, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. The Stanford CoreNLP natural language processing toolkit. In ACL, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  22. M. C. McCord, J. W. Murdock, and B. Boguraev. Deep parsing in Watson. IBM Journal, 56(3), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Milne and I. Witten. An open-source toolkit for mining wikipedia. In NZCSRSC, volume 9, 2009.Google ScholarGoogle Scholar
  24. A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In ECML, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Nivre, J. Hall, J. Nilsson, A. Chanev, G. Eryigit, S. Kübler, S. Marinov, and E. Marsi. Maltparser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  26. M. Pasca. Open-Domain Question Answering from Large Text Collections. CSLI Publications, 2003.Google ScholarGoogle Scholar
  27. V. Punyakanok and D. Roth. The use of classifiers in sequential inference. In NIPS, pages 995--1001, 2001.Google ScholarGoogle Scholar
  28. F. Radlinski and T. Joachims. Query chains: Learning to rank from implicit feedback. CoRR, 2006.Google ScholarGoogle Scholar
  29. A. Severyn and A. Moschitti. Automatic feature engineering for answer selection and extraction. In EMNLP, 2013.Google ScholarGoogle Scholar
  30. A. Severyn and A. Moschitti. Learning to rank short text pairs with convolutional deep neural networks. In SIGIR, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. Severyn, M. Nicosia, and A. Moschitti. Building structures from classifiers for passage reranking. In CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. Severyn, M. Nicosia, and A. Moschitti. Learning adaptable patterns for passage reranking. CoNLL-2013, page 75, 2013.Google ScholarGoogle Scholar
  33. J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge University Press, 2004. Google ScholarGoogle ScholarCross RefCross Ref
  34. D. Shen and M. Lapata. Using semantic roles to improve question answering. In EMNLP-CoNLL, 2007.Google ScholarGoogle Scholar
  35. S. Small, T. Strzalkowski, T. Liu, S. Ryan, R. Salkin, N. Shimizu, P. Kantor, D. Kelly, and N. Wacholder. Hitiqa: Towards analytical question answering. In COLING, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M. Surdeanu, M. Ciaramita, and H. Zaragoza. Learning to rank answers on large online QA collections. In ACL, 2008.Google ScholarGoogle Scholar
  37. K. Tymoshenko, A. Moschitti, and A. Severyn. Encoding semantic resources in syntactic structures for passage reranking. In Proceedings of EACL, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  38. E. M. Voorhees. Overview of the TREC 2001 Question Answering Track. In Proceedings of TREC, 2001.Google ScholarGoogle Scholar
  39. E. M. Voorhees. Overview of the trec 2003 question answering track. In TREC 2003, 2003.Google ScholarGoogle Scholar
  40. D. Wang and E. Nyberg. A long short-term memory model for answer sentence selection in question answering. In ACL, July 2015.Google ScholarGoogle ScholarCross RefCross Ref
  41. M. Wang and C. D. Manning. Probabilistic tree-edit models with structured latent variables for textual entailment and question answering. In ACL, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. M. Wang, N. A. Smith, and T. Mitamura. What is the jeopardy model? a quasi-synchronous grammar for qa. In EMNLP-CoNLL, 2007.Google ScholarGoogle Scholar
  43. P. C. Xuchen Yao, Benjamin Van Durme and C. Callison-Burch. Answer extraction as sequence tagging with tree edit distance. In NAACL, 2013.Google ScholarGoogle Scholar
  44. X. Yao, B. Van Durme, and P. Clark. Answer extraction as sequence tagging with tree edit distance. In Proceedings of NAACL-HLT, pages 858--867, 2013.Google ScholarGoogle Scholar
  45. W.-t. Yih, M.-W. Chang, C. Meek, and A. Pastusiak. Question answering using enhanced lexical semantic models. In ACL, 2013.Google ScholarGoogle Scholar

Index Terms

  1. Assessing the Impact of Syntactic and Semantic Structures for Answer Passages Reranking

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
        October 2015
        1998 pages
        ISBN:9781450337946
        DOI:10.1145/2806416

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 October 2015

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        CIKM '15 Paper Acceptance Rate165of646submissions,26%Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader