Research Article
BibTex RIS Cite

NATURAL LANGUAGE QUESTION ANSWERING SYSTEM OVER LINKED DATA

Year 2019, Volume: 20 Issue: 3, 274 - 295, 26.09.2019
https://doi.org/10.18038/estubtda.624498

Abstract



Linked Data project is aimed to give
more details on any subject through the big knowledge bases defined on the web.
In this context, knowledge bases offer endpoint service user interfaces to
query their data. Because of the SPARQL query language limitation of these
knowledge bases, a significant number of web users are unable to benefit from
these services. In this paper, an English natural language question answering
system over Linked Data is proposed in order to eliminate this limitation. The proposed
system's main processes can be listed as follows: (1) Extracting Part-Of-Speech
(POS) tags, (2) pattern extraction & preparing appropriate SPARQL queries,
(3) executing user queries & displaying the results. The features which are
not provided by the endpoint services of knowledge bases such as dynamic
paging, voice search and answer vocalization which make the usage of the proposed system to be possible by the visually-impaired
web users, question-answer caching, social media integration, and live spell
checking are proposed. According to experimental results, the proposed system’s
question answering performance is improved between 2 and 12 times through the
type of natural language question thanks to the question-answer caching
mechanism.

References

  • [1] Berners-Lee T, Hendler J, Lassila O. The Semantic Web. Sci Am Citeseer 2001; 284(5):34–43.
  • [2] Metz C. Tim, Lucy, and The Semantic Web. PC Mag 2007.
  • [3] Rapoza J. SPARQL Will Make the Web Shine. eweek 2006.
  • [4] Segaran T, Evans C, Taylor J. Programming the Semantic Web. 1st ed. Sebastopol, CA, USA: O’Reilly Media, 2009.
  • [5] Bizer C, Heath T, Berners-Lee T. Linked data-the story so far. Int J Semant Web Inf Syst 2009; 5 (3):1–22.
  • [6] Doncel VR. How O is the LOD cloud? 2015. Available at: http://www.cosasbuenas.es/blog/how-o-is-lod-2015; Accessed: 10-Jun-2018.
  • [7] State of the LOD Cloud. University of Mannheim 2015. Available at: http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/; Accessed: 12-May-2019.
  • [8] Linked Open Data cloud diagram as of March 2019. Linked Open Cloud 2019. Available at: https://lod-cloud.net/versions/2019-03-29/lod-cloud.png; Accessed: 12-May-2019.
  • [9] Lehmann J, Schüppel J, Auer S. Discovering Unknown Connections – the DBpedia Relationship Finder. In: Proceedings of the 1st SABRE Conference on Social Semantic Web - CSSW’07; 26-28 September 2007; Leipzig, Germany.
  • [10] Kabakus AT, Dogdu E. Question Answering System over Linked Data. In: 7th National Software Engineering UYMS’13; 26 September 2013; İzmir, Turkey.
  • [11] Bizer C, Lehmann J, Kobilarov G, et al. DBpedia - A crystallization point for the Web of Data. Web Semant Sci Serv Agents World Wide Web 2009; 7 (3):154–65.
  • [12] About | DBpedia. DBpedia 2019. Available at: https://wiki.dbpedia.org/about; Accessed: 12-May-2019.
  • [13] Why do people use the Internet? addictionblog 2011. Available at: http://internet.addictionblog.org/why-do-people-use-the-internet-10-reasons/; Accessed: 12-May-2019.
  • [14] Weider K. 15 Reasons Why People Use The Internet. Weider Web Solutions 2016. Available at: https://www.weiderweb.com/15-reasons-why-people-use-the-internet-and-how-to-use-that-to-your-advantage/; Accessed: 12-May-2019.
  • [15] New AOL Content Research: How Eight Moments Define Behavior Around the World. AOL 2016. Available at: https://advertising.aol.com/en/blog/new-aol-content-research-how-eight-moments-define-behavior-around-world; Accessed: 12-May-2019.
  • [16] Ostuni VC, Gentile G, Noia T Di, et al. Mobile Movie Recommendations with Linked Data. In: CD-ARES 2013: Availability, Reliability, and Security in Information Systems and HCI; 2-6 September 2013; Regensburg, Germany.
  • [17] Ell B, Vrandecic D, Simperl E. SPARTIQULATION: Verbalizing SPARQL queries. In: The Semantic Web: ESWC 2012 Satellite Events; May 27-31 2012; Heraklion, Greece.
  • [18] Lopez V, Fernández M, Motta E, et al. PowerAqua: Supporting users in querying and exploring the Semantic Web. Semant Web 2012; 3 (3):249–65.
  • [19] Cabrio E, Cojan J, Aprosio AP, et al. QAKiS: An open domain QA system based on relational patterns. In: Proceedings of the ISWC 2012 Posters & Demonstrations Track, volume 914 of CEUR Workshop Proceedings; 11-15 November 2012; Boston, MA, USA.
  • [20] Damljanovic D, Agatonovic M, Cunningham H. FREyA: an Interactive Way of Querying Linked Data using Natural Language. In: Proceedings of the Workshop on Question Answering Over Linked Data (QALD); 8-9 October 2011; Heraklion, Greece.
  • [21] Ferrucci D, Brown E, Chu-Carroll J, et al. Building Watson: An Overview of the DeepQA Project. AI Magazine 2010; 31 (3):59–79.
  • [22] Watson – A System Designed for Answers. IBM Syst Technol 2011.
  • [23] Unger C, Bühmann L. Template-based question answering over RDF data. In: Proceedings of the 21st international conference on World Wide Web; 16-20 April 2012; Lyon, France.
  • [24] Unger C, Cimiano P. Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web. In: Munoz, Rafael, Montoyo, Andres, Metais, Elisabeth, editors. Natural Language Processing and Information Systems. Berlin, Heidelberg, Germany: Springer International Publishing, 2011. pp. 153–160.
  • [25] Cimiano P, Haase P, Heizmann J, et al. Towards portable natural language interfaces to knowledge bases - The case of the ORAKEL system. Data Knowl Eng 2008; 65 (2):325–54.
  • [26] Unger C, Freitas A, Cimiano P. An Introduction to Question Answering over Linked Data. In: Reasoning Web 2014: Reasoning Web. Reasoning on the Web in the Big Data Era; 8-13 September 2014; Athens, Greece.
  • [27] Yahya M, Berberich K, Elbassuoni S, et al. Natural language questions for the web of data. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL’12); 12-14 July 2012; Jeju Island, Korea.
  • [28] Tran T, Wang H, Rudolph S, et al. Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data. In: 2009 IEEE 25th International Conference on Data Engineering; 29 March-2 April 2009; Shanghai, China.
  • [29] Shekarpour S, Ngonga Ngomo A-C, Auer S. Question answering on interlinked data. In: Proceedings of the 22nd international conference on World Wide Web - WWW ’13; 13-17 May 2013; Rio de Janeiro, Brazil.
  • [30] Schmid H. Penn Treebank P.O.S. Tags. University of Stuttgart 2003. Available at: https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html; Accessed: 12-May-2019.
  • [31] Leavitt N. Will NoSQL Databases Live Up to Their Promise? Computer (Long Beach Calif) 2010; 43 (2):12–14.
  • [32] Okman L, Gal-Oz N, Gonen Y, et al. Security Issues in NoSQL Databases. In: 2011 IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications; 16-18 November 2011; Changsha, China.
  • [33] Hasan SS, Isaac RK. An integrated approach of MAS-CommonKADS, Model–View–Controller and web application optimization strategies for web-based expert system development. Expert Syst Appl 2011; 38 (1):417–28.
  • [34] Varma V. Software Architecture: A Case Based Approach. New Delhi, India: Pearson Education India, 2009.
  • [35] One minute social media infographic. Social Jumpstart 2012. Available at: http://socialmediachimps.com/wp-content/uploads/2012/03/one-minute-social-media-infographic.jpg; Accessed: 12-May-2019.
  • [36] Infographic: How often people interact online. Go-Globe 2013. Available at: https://techmarketingbuffalo.com/wp-content/uploads/2013/06/infographic-how-often-people-interact-online.jpg; Accessed: 12-May-2019.
  • [37] Wickre K. Celebrating #Twitter7. Twitter 2013[Online] 2013 [cited 2019]. Available at: https://blog.twitter.com/official/en_us/a/2013/celebrating-twitter7.html; Accessed: 12-May-2019.
Year 2019, Volume: 20 Issue: 3, 274 - 295, 26.09.2019
https://doi.org/10.18038/estubtda.624498

Abstract

References

  • [1] Berners-Lee T, Hendler J, Lassila O. The Semantic Web. Sci Am Citeseer 2001; 284(5):34–43.
  • [2] Metz C. Tim, Lucy, and The Semantic Web. PC Mag 2007.
  • [3] Rapoza J. SPARQL Will Make the Web Shine. eweek 2006.
  • [4] Segaran T, Evans C, Taylor J. Programming the Semantic Web. 1st ed. Sebastopol, CA, USA: O’Reilly Media, 2009.
  • [5] Bizer C, Heath T, Berners-Lee T. Linked data-the story so far. Int J Semant Web Inf Syst 2009; 5 (3):1–22.
  • [6] Doncel VR. How O is the LOD cloud? 2015. Available at: http://www.cosasbuenas.es/blog/how-o-is-lod-2015; Accessed: 10-Jun-2018.
  • [7] State of the LOD Cloud. University of Mannheim 2015. Available at: http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/; Accessed: 12-May-2019.
  • [8] Linked Open Data cloud diagram as of March 2019. Linked Open Cloud 2019. Available at: https://lod-cloud.net/versions/2019-03-29/lod-cloud.png; Accessed: 12-May-2019.
  • [9] Lehmann J, Schüppel J, Auer S. Discovering Unknown Connections – the DBpedia Relationship Finder. In: Proceedings of the 1st SABRE Conference on Social Semantic Web - CSSW’07; 26-28 September 2007; Leipzig, Germany.
  • [10] Kabakus AT, Dogdu E. Question Answering System over Linked Data. In: 7th National Software Engineering UYMS’13; 26 September 2013; İzmir, Turkey.
  • [11] Bizer C, Lehmann J, Kobilarov G, et al. DBpedia - A crystallization point for the Web of Data. Web Semant Sci Serv Agents World Wide Web 2009; 7 (3):154–65.
  • [12] About | DBpedia. DBpedia 2019. Available at: https://wiki.dbpedia.org/about; Accessed: 12-May-2019.
  • [13] Why do people use the Internet? addictionblog 2011. Available at: http://internet.addictionblog.org/why-do-people-use-the-internet-10-reasons/; Accessed: 12-May-2019.
  • [14] Weider K. 15 Reasons Why People Use The Internet. Weider Web Solutions 2016. Available at: https://www.weiderweb.com/15-reasons-why-people-use-the-internet-and-how-to-use-that-to-your-advantage/; Accessed: 12-May-2019.
  • [15] New AOL Content Research: How Eight Moments Define Behavior Around the World. AOL 2016. Available at: https://advertising.aol.com/en/blog/new-aol-content-research-how-eight-moments-define-behavior-around-world; Accessed: 12-May-2019.
  • [16] Ostuni VC, Gentile G, Noia T Di, et al. Mobile Movie Recommendations with Linked Data. In: CD-ARES 2013: Availability, Reliability, and Security in Information Systems and HCI; 2-6 September 2013; Regensburg, Germany.
  • [17] Ell B, Vrandecic D, Simperl E. SPARTIQULATION: Verbalizing SPARQL queries. In: The Semantic Web: ESWC 2012 Satellite Events; May 27-31 2012; Heraklion, Greece.
  • [18] Lopez V, Fernández M, Motta E, et al. PowerAqua: Supporting users in querying and exploring the Semantic Web. Semant Web 2012; 3 (3):249–65.
  • [19] Cabrio E, Cojan J, Aprosio AP, et al. QAKiS: An open domain QA system based on relational patterns. In: Proceedings of the ISWC 2012 Posters & Demonstrations Track, volume 914 of CEUR Workshop Proceedings; 11-15 November 2012; Boston, MA, USA.
  • [20] Damljanovic D, Agatonovic M, Cunningham H. FREyA: an Interactive Way of Querying Linked Data using Natural Language. In: Proceedings of the Workshop on Question Answering Over Linked Data (QALD); 8-9 October 2011; Heraklion, Greece.
  • [21] Ferrucci D, Brown E, Chu-Carroll J, et al. Building Watson: An Overview of the DeepQA Project. AI Magazine 2010; 31 (3):59–79.
  • [22] Watson – A System Designed for Answers. IBM Syst Technol 2011.
  • [23] Unger C, Bühmann L. Template-based question answering over RDF data. In: Proceedings of the 21st international conference on World Wide Web; 16-20 April 2012; Lyon, France.
  • [24] Unger C, Cimiano P. Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web. In: Munoz, Rafael, Montoyo, Andres, Metais, Elisabeth, editors. Natural Language Processing and Information Systems. Berlin, Heidelberg, Germany: Springer International Publishing, 2011. pp. 153–160.
  • [25] Cimiano P, Haase P, Heizmann J, et al. Towards portable natural language interfaces to knowledge bases - The case of the ORAKEL system. Data Knowl Eng 2008; 65 (2):325–54.
  • [26] Unger C, Freitas A, Cimiano P. An Introduction to Question Answering over Linked Data. In: Reasoning Web 2014: Reasoning Web. Reasoning on the Web in the Big Data Era; 8-13 September 2014; Athens, Greece.
  • [27] Yahya M, Berberich K, Elbassuoni S, et al. Natural language questions for the web of data. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL’12); 12-14 July 2012; Jeju Island, Korea.
  • [28] Tran T, Wang H, Rudolph S, et al. Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data. In: 2009 IEEE 25th International Conference on Data Engineering; 29 March-2 April 2009; Shanghai, China.
  • [29] Shekarpour S, Ngonga Ngomo A-C, Auer S. Question answering on interlinked data. In: Proceedings of the 22nd international conference on World Wide Web - WWW ’13; 13-17 May 2013; Rio de Janeiro, Brazil.
  • [30] Schmid H. Penn Treebank P.O.S. Tags. University of Stuttgart 2003. Available at: https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html; Accessed: 12-May-2019.
  • [31] Leavitt N. Will NoSQL Databases Live Up to Their Promise? Computer (Long Beach Calif) 2010; 43 (2):12–14.
  • [32] Okman L, Gal-Oz N, Gonen Y, et al. Security Issues in NoSQL Databases. In: 2011 IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications; 16-18 November 2011; Changsha, China.
  • [33] Hasan SS, Isaac RK. An integrated approach of MAS-CommonKADS, Model–View–Controller and web application optimization strategies for web-based expert system development. Expert Syst Appl 2011; 38 (1):417–28.
  • [34] Varma V. Software Architecture: A Case Based Approach. New Delhi, India: Pearson Education India, 2009.
  • [35] One minute social media infographic. Social Jumpstart 2012. Available at: http://socialmediachimps.com/wp-content/uploads/2012/03/one-minute-social-media-infographic.jpg; Accessed: 12-May-2019.
  • [36] Infographic: How often people interact online. Go-Globe 2013. Available at: https://techmarketingbuffalo.com/wp-content/uploads/2013/06/infographic-how-often-people-interact-online.jpg; Accessed: 12-May-2019.
  • [37] Wickre K. Celebrating #Twitter7. Twitter 2013[Online] 2013 [cited 2019]. Available at: https://blog.twitter.com/official/en_us/a/2013/celebrating-twitter7.html; Accessed: 12-May-2019.
There are 37 citations in total.

Details

Primary Language English
Subjects Engineering
Journal Section Articles
Authors

Abdullah Talha Kabakuş 0000-0003-2181-4292

Aydın Çetin 0000-0002-8669-823X

Publication Date September 26, 2019
Published in Issue Year 2019 Volume: 20 Issue: 3

Cite

AMA Kabakuş AT, Çetin A. NATURAL LANGUAGE QUESTION ANSWERING SYSTEM OVER LINKED DATA. Eskişehir Technical University Journal of Science and Technology A - Applied Sciences and Engineering. September 2019;20(3):274-295. doi:10.18038/estubtda.624498