research-article

Generating Rules to Filter Candidate Triples for their Correctness Checking by Knowledge Graph Completion Techniques

Authors:
Agustín Borrego

University of Seville, Seville, Spain

University of Seville, Seville, Spain
View Profile

,
Daniel Ayala

University of Seville, Seville, Spain

University of Seville, Seville, Spain
View Profile

,
Inma Hernández

University of Seville, Seville, Spain

University of Seville, Seville, Spain
View Profile

,
Carlos R. Rivero

Rochester Institute of Technology, Rochester, NY, USA

Rochester Institute of Technology, Rochester, NY, USA
View Profile

,
David Ruiz

University of Seville, Seville, Spain

University of Seville, Seville, Spain
View Profile

K-CAP '19: Proceedings of the 10th International Conference on Knowledge CaptureSeptember 2019Pages 115–122https://doi.org/10.1145/3360901.3364418

Published:23 September 2019Publication History

K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture

Pages 115–122

ABSTRACT

Knowledge Graphs (KGs) contain large amounts of structured information. Due to their inherent incompleteness, a process known as KG completion is often carried out to find the missing triples in a KG, usually by training a fact checking model that is able to discern between correct and incorrect knowledge. After the fact checking model has been trained and evaluated, it has to be applied to a set of candidate triples, and those that are considered correct are added to the KG as new knowledge. However, this process needs a set of candidate triples of a reasonable size that represents possible new knowledge, in order to be evaluated by the fact checking task and, if considered to be correct, added to the KG, enriching it. Current approaches for selecting candidate triples for their correctness checking either use the full set possible missing candidate triples (and thus provide no filtering) or apply very basic rules to filter out unlikely candidates, which may have a negative effect on the completion performance as very few candidate triples are filtered out. In this paper we present CHAI, a method for producing more complex rules that are able to filter candidate triples by combining a set of criteria to optimize a fitness function. Our experiments show that CHAI is able to generate rules that, when applied, yield smaller candidate sets than similar proposals while still including promising candidate triples.

References

Daniel Ayala, Agustín Borrego, Inma Hernández, Carlos R. Rivero, and David Ruiz. 2019. AYNEC: All You Need for Evaluating Completion Techniques in Knowledge Graphs. In ESWC. 397--411. https://doi.org/10.1007/978--3-030--21348-0_26Google Scholar
Daniel Ayala, Inma Hernández, David Ruiz, and Miguel Toro. 2019. TAPON: A two-phase machine learning approach for semantic labelling. Knowledge-Based Systems 163 (2019), 931--943. https://doi.org/10.1016/j.knosys.2018.10.017Google ScholarCross Ref
Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. In SIGMOD. ACM, 1247--1250. https://doi.org/10.1145/1376616. 1376746Google Scholar
Antoine Bordes, Sumit Chopra, and JasonWeston. 2014. Question Answering with Subgraph Embeddings. In EMNLP. ACL, 615--620. http://aclweb.org/anthology/ D/D14/D14--1067.pdfGoogle Scholar
Antoine Bordes and Evgeniy Gabrilovich. 2014. Constructing and Mining Webscale Knowledge Graphs. In SIGKDD. ACM, 1967--1967. https://doi.org/10.1145/ 2623330.2630803Google Scholar
Antoine Bordes, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2014. A semantic matching energy function for learning with multi-relational data - Application to word-sense disambiguation. Machine Learning 94, 2 (2014), 233-- 259. https://doi.org/10.1007/s10994-013--5363--6Google ScholarDigital Library
Antoine Bordes, Nicolas Usunier, Alberto García-Durán, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In NIPS. 2787--2795. https://papers.nips.cc/paper/5071-translatingembeddings- for-modeling-multi-relational-dataGoogle Scholar
Xin Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, and Wei Zhang. 2014. Knowledge Vault: A Web-scale Approach to Probabilistic Knowledge Fusion. In SIGKDD. ACM, 601--610. https://doi.org/10.1145/2623330.2623623Google Scholar
Sébastien Ferré. 2019. Link Prediction in Knowledge Graphs with Concepts of Nearest Neighbours. In ESWC. Springer International Publishing, 84--100. https://doi.org/10.1007/978--3-030--21348-0_6Google Scholar
Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. 2015. Fast rule mining in ontological knowledge bases with AMIE+. VLDB J. 24, 6 (2015), 707--730. https://doi.org/10.1007/s00778-015-0394--1Google ScholarDigital Library
Matt Gardner and Tom Mitchell. 2015. Efficient and expressive knowledge base completion using subgraph feature extraction. In EMNLP. The Association for Computational Linguistics, 1488--1498. https://aclweb.org/anthology/D/D15/ D15--1173.pdfGoogle Scholar
Matt Gardner, Partha Pratim Talukdar, Jayant Krishnamurthy, and Tom M. Mitchell. 2014. Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases. In EMNLP. 397--406. http://aclweb.org/anthology/D/D14/ D14--1044.pdfGoogle Scholar
Michael Glass and Alfio Gliozzo. 2018. A Dataset for Web-Scale Knowledge Base Population. In ESWC. Springer, 256--271. https://doi.org/10.1007/978--3--319- 93417--4_17Google Scholar
Vinh Thinh Ho, Daria Stepanova, Mohamed H. Gad-Elrab, Evgeny Kharlamov, and Gerhard Weikum. 2018. Learning Rules from Incomplete KGs using Embeddings. In ISWC, Vol. 2180. http://ceur-ws.org/Vol-2180/paper-25.pdfGoogle Scholar
Ni Lao and William W Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine learning 81, 1 (2010), 53--67. https: //doi.org/10.1007/s10994-010--5205--8Google Scholar
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick Van Kleef, Sören Auer, and Christian Bizer. 2015. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195. https://doi.org/10.3233/SW-140134Google ScholarCross Ref
Peng Lin, Qi Song, and Yinghui Wu. 2018. Fact Checking in Knowledge Graphs with Ontological Subgraph Patterns. Data Science and Engineering 3 (2018), 341--358. https://doi.org/10.1007/s41019-018-0082--4Google ScholarCross Ref
Sahisnu Mazumder and Bing Liu. 2017. Context-aware Path Ranking for Knowledge Base Completion. In IJCAI. AAAI Press, 1195--1201. https://doi.org/10. 24963/ijcai.2017/166Google Scholar
Christian Meilicke, Melisachew Wudage Chekol, Daniel Ruffinelli, and Heiner Stuckenschmidt. 2019. Anytime Bottom-Up Rule Learning for Knowledge Graph Completion. In IJCAI. ijcai.org, 3137--3143. https://doi.org/10.24963/ijcai.2019/435Google Scholar
George A. Miller. 1995. WordNet: A Lexical Database for English. Commun. ACM 38, 11 (1995), 39--41. https://doi.org/10.1145/219717.219748Google ScholarDigital Library
T. Mitchell,W. Cohen, E. Hruschka, P. Talukdar, B. Yang, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, and J. Welling. 2018. Never-ending Learning. Commun. ACM 61, 5 (2018), 103--115. https://doi.org/10.1145/3191513Google ScholarDigital Library
Maximilian Nickel, Volker Tresp, and Hans-Peter Kriegel. 2012. Factorizing YAGO: scalable machine learning for linked data. In WWW. ACM, 271--280. https://doi.org/10.1145/2187836.2187874Google Scholar
Pouya Ghiasnezhad Omran, Kewen Wang, and Zhe Wang. 2018. Scalable Rule Learning via Learning Representation. In IJCAI. ijcai.org, 2149--2155. https: //doi.org/10.24963/ijcai.2018/297Google Scholar
Heiko Paulheim. 2017. Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web 8, 3 (2017), 489--508. https://doi.org/10. 3233/SW-160218Google ScholarDigital Library
Baoxu Shi and Tim Weninger. 2018. Open-World Knowledge Graph Completion. In AAAI. 1957--1964. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/ view/16055Google Scholar
Richard Socher, Danqi Chen, Christopher D Manning, and Andrew Ng. 2013. Reasoning with neural tensor networks for knowledge base completion. In NIPS. 926-- 934. https://papers.nips.cc/paper/5028-reasoning-with-neural-tensor-networksfor- knowledge-base-completionGoogle Scholar
Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. 2014. Knowledge Graph Embedding by Translating on Hyperplanes. In AAAI, Vol. 14. AAAI Press, 1112--1119. https://www.aaai.org/ocs/index.php/AAAI/AAAI14/paper/view/ 8531Google ScholarDigital Library
Zhuoyu Wei, Jun Zhao, Kang Liu, Zhenyu Qi, Zhengya Sun, and Guanhua Tian. 2015. Large-scale Knowledge Base Completion: Inferring via Grounding Network Sampling over Selected Instances. In CIKM. 1331--1340. https://doi.org/10.1145/ 2806416.2806513Google Scholar
Ziqi Zhang. 2017. Effective and efficient Semantic Table Interpretation using TableMiner+. Semantic Web 8, 6 (2017), 921--957. https://doi.org/10.3233/SW- 160242Google ScholarDigital Library

Index Terms

Generating Rules to Filter Candidate Triples for their Correctness Checking by Knowledge Graph Completion Techniques
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Active knowledge graph completion
Abstract
Enterprise and public Knowledge Graphs (KGs) are known to be incomplete. Methods for automatic completion, sometimes by rule learning, scale well. While previous rule-based methods learn closed (non-existential) rules, we introduce ...
Read More
Learning and Deduction of Rules for Knowledge Graph Completion
ICBDC '20: Proceedings of the 5th International Conference on Big Data and Computing

The amount of training data or knowledge determines the precision of Knowledge Graph Completion (KGC), since traditional methods require as much training data as possible for model training or embedding to ensure the accurate expression of the knowledge ...
Read More
Knowledge Graph Augmentation with Entity Identification for Improving Knowledge Graph Completion Performance
PRICAI 2023: Trends in Artificial Intelligence
Abstract
A knowledge graph often lacks some existent triples. Knowledge graph completion is a technique for complementing such triples and its performance can be improved by augmenting triples from other external databases. However, entity names often ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture
September 2019
281 pages
ISBN:9781450370080
DOI:10.1145/3360901
General Chairs:
Mayank Kejriwal
University of Southern California Information Sciences Institute, USA
,
Pedro Szekely
University of Southern California Information Sciences Institute, USA
,
Program Chair:
Raphaël Troncy
EURECOM, France
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 September 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
candidate filtering
knowledge graph completion
knowledge graphs
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate55of198submissions,28%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 283
  Total Downloads
- Downloads (Last 12 months)35
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Generating Rules to Filter Candidate Triples for their Correctness Checking by Knowledge Graph Completion Techniques

K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture

ABSTRACT

References

Cited By

Index Terms

Recommendations

Active knowledge graph completion

Learning and Deduction of Rules for Knowledge Graph Completion

Knowledge Graph Augmentation with Entity Identification for Improving Knowledge Graph Completion Performance