skip to main content
10.1145/3360901.3364418acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Generating Rules to Filter Candidate Triples for their Correctness Checking by Knowledge Graph Completion Techniques

Published:23 September 2019Publication History

ABSTRACT

Knowledge Graphs (KGs) contain large amounts of structured information. Due to their inherent incompleteness, a process known as KG completion is often carried out to find the missing triples in a KG, usually by training a fact checking model that is able to discern between correct and incorrect knowledge. After the fact checking model has been trained and evaluated, it has to be applied to a set of candidate triples, and those that are considered correct are added to the KG as new knowledge. However, this process needs a set of candidate triples of a reasonable size that represents possible new knowledge, in order to be evaluated by the fact checking task and, if considered to be correct, added to the KG, enriching it. Current approaches for selecting candidate triples for their correctness checking either use the full set possible missing candidate triples (and thus provide no filtering) or apply very basic rules to filter out unlikely candidates, which may have a negative effect on the completion performance as very few candidate triples are filtered out. In this paper we present CHAI, a method for producing more complex rules that are able to filter candidate triples by combining a set of criteria to optimize a fitness function. Our experiments show that CHAI is able to generate rules that, when applied, yield smaller candidate sets than similar proposals while still including promising candidate triples.

References

  1. Daniel Ayala, Agustín Borrego, Inma Hernández, Carlos R. Rivero, and David Ruiz. 2019. AYNEC: All You Need for Evaluating Completion Techniques in Knowledge Graphs. In ESWC. 397--411. https://doi.org/10.1007/978--3-030--21348-0_26Google ScholarGoogle Scholar
  2. Daniel Ayala, Inma Hernández, David Ruiz, and Miguel Toro. 2019. TAPON: A two-phase machine learning approach for semantic labelling. Knowledge-Based Systems 163 (2019), 931--943. https://doi.org/10.1016/j.knosys.2018.10.017Google ScholarGoogle ScholarCross RefCross Ref
  3. Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. In SIGMOD. ACM, 1247--1250. https://doi.org/10.1145/1376616. 1376746Google ScholarGoogle Scholar
  4. Antoine Bordes, Sumit Chopra, and JasonWeston. 2014. Question Answering with Subgraph Embeddings. In EMNLP. ACL, 615--620. http://aclweb.org/anthology/ D/D14/D14--1067.pdfGoogle ScholarGoogle Scholar
  5. Antoine Bordes and Evgeniy Gabrilovich. 2014. Constructing and Mining Webscale Knowledge Graphs. In SIGKDD. ACM, 1967--1967. https://doi.org/10.1145/ 2623330.2630803Google ScholarGoogle Scholar
  6. Antoine Bordes, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2014. A semantic matching energy function for learning with multi-relational data - Application to word-sense disambiguation. Machine Learning 94, 2 (2014), 233-- 259. https://doi.org/10.1007/s10994-013--5363--6Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Antoine Bordes, Nicolas Usunier, Alberto García-Durán, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In NIPS. 2787--2795. https://papers.nips.cc/paper/5071-translatingembeddings- for-modeling-multi-relational-dataGoogle ScholarGoogle Scholar
  8. Xin Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, and Wei Zhang. 2014. Knowledge Vault: A Web-scale Approach to Probabilistic Knowledge Fusion. In SIGKDD. ACM, 601--610. https://doi.org/10.1145/2623330.2623623Google ScholarGoogle Scholar
  9. Sébastien Ferré. 2019. Link Prediction in Knowledge Graphs with Concepts of Nearest Neighbours. In ESWC. Springer International Publishing, 84--100. https://doi.org/10.1007/978--3-030--21348-0_6Google ScholarGoogle Scholar
  10. Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. 2015. Fast rule mining in ontological knowledge bases with AMIE+. VLDB J. 24, 6 (2015), 707--730. https://doi.org/10.1007/s00778-015-0394--1Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Matt Gardner and Tom Mitchell. 2015. Efficient and expressive knowledge base completion using subgraph feature extraction. In EMNLP. The Association for Computational Linguistics, 1488--1498. https://aclweb.org/anthology/D/D15/ D15--1173.pdfGoogle ScholarGoogle Scholar
  12. Matt Gardner, Partha Pratim Talukdar, Jayant Krishnamurthy, and Tom M. Mitchell. 2014. Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases. In EMNLP. 397--406. http://aclweb.org/anthology/D/D14/ D14--1044.pdfGoogle ScholarGoogle Scholar
  13. Michael Glass and Alfio Gliozzo. 2018. A Dataset for Web-Scale Knowledge Base Population. In ESWC. Springer, 256--271. https://doi.org/10.1007/978--3--319- 93417--4_17Google ScholarGoogle Scholar
  14. Vinh Thinh Ho, Daria Stepanova, Mohamed H. Gad-Elrab, Evgeny Kharlamov, and Gerhard Weikum. 2018. Learning Rules from Incomplete KGs using Embeddings. In ISWC, Vol. 2180. http://ceur-ws.org/Vol-2180/paper-25.pdfGoogle ScholarGoogle Scholar
  15. Ni Lao and William W Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine learning 81, 1 (2010), 53--67. https: //doi.org/10.1007/s10994-010--5205--8Google ScholarGoogle Scholar
  16. Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick Van Kleef, Sören Auer, and Christian Bizer. 2015. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195. https://doi.org/10.3233/SW-140134Google ScholarGoogle ScholarCross RefCross Ref
  17. Peng Lin, Qi Song, and Yinghui Wu. 2018. Fact Checking in Knowledge Graphs with Ontological Subgraph Patterns. Data Science and Engineering 3 (2018), 341--358. https://doi.org/10.1007/s41019-018-0082--4Google ScholarGoogle ScholarCross RefCross Ref
  18. Sahisnu Mazumder and Bing Liu. 2017. Context-aware Path Ranking for Knowledge Base Completion. In IJCAI. AAAI Press, 1195--1201. https://doi.org/10. 24963/ijcai.2017/166Google ScholarGoogle Scholar
  19. Christian Meilicke, Melisachew Wudage Chekol, Daniel Ruffinelli, and Heiner Stuckenschmidt. 2019. Anytime Bottom-Up Rule Learning for Knowledge Graph Completion. In IJCAI. ijcai.org, 3137--3143. https://doi.org/10.24963/ijcai.2019/435Google ScholarGoogle Scholar
  20. George A. Miller. 1995. WordNet: A Lexical Database for English. Commun. ACM 38, 11 (1995), 39--41. https://doi.org/10.1145/219717.219748Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. T. Mitchell,W. Cohen, E. Hruschka, P. Talukdar, B. Yang, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, and J. Welling. 2018. Never-ending Learning. Commun. ACM 61, 5 (2018), 103--115. https://doi.org/10.1145/3191513Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Maximilian Nickel, Volker Tresp, and Hans-Peter Kriegel. 2012. Factorizing YAGO: scalable machine learning for linked data. In WWW. ACM, 271--280. https://doi.org/10.1145/2187836.2187874Google ScholarGoogle Scholar
  23. Pouya Ghiasnezhad Omran, Kewen Wang, and Zhe Wang. 2018. Scalable Rule Learning via Learning Representation. In IJCAI. ijcai.org, 2149--2155. https: //doi.org/10.24963/ijcai.2018/297Google ScholarGoogle Scholar
  24. Heiko Paulheim. 2017. Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web 8, 3 (2017), 489--508. https://doi.org/10. 3233/SW-160218Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Baoxu Shi and Tim Weninger. 2018. Open-World Knowledge Graph Completion. In AAAI. 1957--1964. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/ view/16055Google ScholarGoogle Scholar
  26. Richard Socher, Danqi Chen, Christopher D Manning, and Andrew Ng. 2013. Reasoning with neural tensor networks for knowledge base completion. In NIPS. 926-- 934. https://papers.nips.cc/paper/5028-reasoning-with-neural-tensor-networksfor- knowledge-base-completionGoogle ScholarGoogle Scholar
  27. Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. 2014. Knowledge Graph Embedding by Translating on Hyperplanes. In AAAI, Vol. 14. AAAI Press, 1112--1119. https://www.aaai.org/ocs/index.php/AAAI/AAAI14/paper/view/ 8531Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Zhuoyu Wei, Jun Zhao, Kang Liu, Zhenyu Qi, Zhengya Sun, and Guanhua Tian. 2015. Large-scale Knowledge Base Completion: Inferring via Grounding Network Sampling over Selected Instances. In CIKM. 1331--1340. https://doi.org/10.1145/ 2806416.2806513Google ScholarGoogle Scholar
  29. Ziqi Zhang. 2017. Effective and efficient Semantic Table Interpretation using TableMiner+. Semantic Web 8, 6 (2017), 921--957. https://doi.org/10.3233/SW- 160242Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Generating Rules to Filter Candidate Triples for their Correctness Checking by Knowledge Graph Completion Techniques

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture
      September 2019
      281 pages
      ISBN:9781450370080
      DOI:10.1145/3360901
      • General Chairs:
      • Mayank Kejriwal,
      • Pedro Szekely,
      • Program Chair:
      • Raphaël Troncy

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 September 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate55of198submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader