Computer Science and Information Systems 2021 Volume 18, Issue 3, Pages: 1101-1138
https://doi.org/10.2298/CSIS200131014N
Full text ( 1361 KB)
Cited by


Conflict resolution using relation classification: High-level data fusion in data integration

Nakhaei Zeinab (Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran), zeinab.nakhaei@srbiau.ac.ir
Ahmadi Ali (Faculty of Computer Engineering, K.N. Toosi University of Technology, Tehran, Iran), ahmadi@kntu.ac.ir
Sharifi Arash (Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran), a.sharifi@srbiau.ac.ir
Badie Kambiz (Iran Telecommunication Research Center (ITRC), Tehran, Iran), k_badie@itrc.ac.ir

The aim of conflict resolution in data integration systems is to identify the true values from among different and conflicting claims about a single entity provided by different data sources. Most data fusion methods for resolving conflicts between entities are based on two estimated parameters: the truthfulness of data and the trustworthiness of sources. The relations between entities are however an additional source of information that can be used in conflict resolution. In this article, we seek to bridge the gap between two important broad areas, relation estimation and truth discovery, and to demonstrate that there is a natural synergistic relationship between machine learning and data fusion. Specifically, we use relational machine learning methods to estimate the relations between entities, and then use these relations to estimate the true value using some fusion functions. An evaluation of the results shows that our proposed approach outperforms existing conflict resolution techniques, especially where there are few reliable sources.

Keywords: conflict resolution, data fusion, relational machine learning, relation estimation, relation classification