Entity Resolution Using Convolutional Neural Network

https://doi.org/10.1016/j.procs.2016.09.306Get rights and content
Under a Creative Commons license
open access

Abstract

Entity resolution is an important application in field of data cleaning. Standard approaches like deterministic methods and probabilistic methods are generally used for this purpose. Many new approaches using single layer perceptron, crowdsourcing etc. are developed to improve the efficiency and also to reduce the time of entity resolution. The approaches used for this purpose also depend on the type of dataset, labeled or unlabeled. This paper presents a new method for labeled data which uses single layered convolutional neural network to perform entity resolution. It also describes how crowdsourcing can be used with the output of the convolutional neural network to further improve the accuracy of the approach while minimizing the cost of crowdsourcing. The paper also discusses the data pre-processing steps used for training the convolutional neural network. Finally it describes the airplane sensor dataset which is used for demonstration of this approach and then shows the experimental results achieved using convolutional neural network.

Keywords

word stemming
word embedding
convolutional neural network
crowdsourcing
hybrid machine-human model

Cited by (0)

Peer-review under responsibility of scientific committee of Missouri University of Science and Technology.