skip to main content
10.1145/3638584.3638602acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaiConference Proceedingsconference-collections
research-article

User entity alignment method based on cross-attribute knowledge association

Published:14 March 2024Publication History

ABSTRACT

User entity alignment is the core technology of associating multisource user identities and constructing user portraits, which is of great significance in cyberspace security, personalized service recommendation, social network, data mining, and other fields. It is difficult to accurately align user entities based on common attributes when the common attributes of multisource user data are sparse. Aiming at the above problem, we propose a user entity alignment method based on cross-attribute knowledge association. Firstly, the attribute values in the user information are linked to the corresponding entities in a knowledge graph, and the representation vector of each attribute value is obtained by embedding the subgraph of the knowledge graph. With the help of knowledge graph, the knowledge association between attribute values is embedded into the attribute vectors. At the same time, to accurately measure the attribute weight, the attribute identification degree is calculated by the distribution of attribute values. Finally, the user representation vector is generated by weighted cumulative attribute value vectors, and the similarity between user vectors is calculated to judge whether two users refer to the same person entity. Experimental results demonstrate that, the accuracy, recall, and F1 score of the proposed method are not less than 0.87 on the person entity dataset with sparse attributes. Compared with existing typical methods based on common attributes and methods based on knowledge graph embedding, the accuracy, recall, and F1 score are 12%, 7% and 10% higher than the comparative algorithm respectively.

References

  1. Zafarani R, Liu H. "Connecting users across social media sites: a behavioral-modeling approach," Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Chicago: ACM Press, 2013: 41-49.Google ScholarGoogle Scholar
  2. Zhang Y, Tang J, Yang Z "Cosnet: connecting heterogeneous social networks with local and global consistency," Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Sydney: ACM Press, 2015: 1485-1494.Google ScholarGoogle Scholar
  3. Korula N, Lattanzi S. "An efficient reconciliation algorithm for social networks," Proceedings of the VLDB Endowment, 2014, 7(5): 377-388.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Lu C T, Shuai H H, Yu P S. "Identifying your customers in social networks," Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. Shanghai: ACM Press, 2014:391-400.Google ScholarGoogle Scholar
  5. Zafarani R, Liu H. "Connecting Corresponding Identities across Communities," ICWSM. 2009, 9: 354–357.Google ScholarGoogle ScholarCross RefCross Ref
  6. Iofciu T, Fankhauser P, Abel F "Identifying Users Across Social Tagging Systems," In International Conference on Weblogs and Social Media, Barcelona, Catalonia, Spain, July. 2010Google ScholarGoogle Scholar
  7. Ikeda M, Ono S, Sato I "Person Name Disambiguation on the Web by Two-Stage Clustering," In 2nd Web People Search Evaluation Workshop (WePS 2009), 18th WWW Conference.Google ScholarGoogle Scholar
  8. Motoyama M, Varghese G. "I seek you: searching and matching individuals in social networks," In Eleventh International Workshop on Web Information and Data Management. 2009: 67–75.Google ScholarGoogle Scholar
  9. Jain P, Kumaraguru P, Joshi A. "identifying users across multiple online social networks," 2013: 1259–1268.Google ScholarGoogle Scholar
  10. Raad E, Chbeir R, Dipanda A. "User Profile Matching in Social Networks," In International Conference on Network-Based Information Systems. 2010: 297–304.Google ScholarGoogle Scholar
  11. Ma J, Qiao Y, Hu G "Balancing User Profile and Social Network Structure for Anchor Link Inferring across Multiple Online Social Networks," IEEE Access. 2017, PP (99): 1-1.Google ScholarGoogle Scholar
  12. Wei Y C, Lin M S et al. "Name Disambiguation in Person Information Mining," IEEE/WIC/ACM International Conference on Web, 2008:378-381Google ScholarGoogle Scholar
  13. Wang X, Sun A, Kardes H, "Probabilistic estimates of attribute statistics and match likelihood for people entity resolution," IEEE International Conference on Big Data. IEEE, 2015.Google ScholarGoogle Scholar
  14. Guan S, Jin X, Jia Y "Self-learning and embedding based entity alignment," IEEE International Conference on Big Knowledge. 2017.Google ScholarGoogle Scholar
  15. Qiu Q, Luo J, Yin M. "Person Name Disambiguation by distinguishing the Importance of features based on Topological Distance," Current Trends in Computer Science and Mechanical Automation Vol.1. De Gruyter Open Poland, 2018, pp. 342-351.Google ScholarGoogle Scholar
  16. Zhang Y, Wang L, Li X "Social Identity Link Across Incomplete Social In-formation Sources Using Anchor Link Expansion," In PAKDD 2016. 2016:395–408.Google ScholarGoogle Scholar
  17. Goga O, Loiseau P, Sommer R "On the Reliability of Profile Matching Across Large Online Social Networks," In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015: 1799–1808.Google ScholarGoogle Scholar
  18. Zhang Y, Tang J, Yang Z "Cosnet: connecting heterogeneous social networks with local and global consistency," In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015: 1485–1494.Google ScholarGoogle Scholar
  19. Zhang Q, Sun Z, Hu W, "Multi-view Knowledge Graph Embedding for Entity Alignment," arXiv preprint arXiv:1906.02390, 2019.Google ScholarGoogle Scholar
  20. Suchanek F M, Abiteboul S, Senellart P. "PARIS: probabilistic alignment of relations, instances, and schema," Proceedings of the Vldb Endowment, 2011, 5(3):157-168.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Lee S, Hwang S. "ARIA: asymmetry resistant instance alignment," Twenty-Eighth AAAI Conference on Artificial Intelligence. 2014.Google ScholarGoogle Scholar
  22. Zhuang Y, Li G L, Feng J H. "Review of Knowledge base entity alignment techniques," Journal of Computer Research and Development, 2016, a53(1):165-192.Google ScholarGoogle Scholar
  23. Fan F, Li Z, Wang Y. "Cohesion based attribute value matching," International Congress on Image & Signal Processing. 2018.Google ScholarGoogle Scholar
  24. Wang X, Sun A, Kardes H "Probabilistic estimates of attribute statistics and match likelihood for people entity resolution," IEEE International Conference on Big Data. IEEE, 2015Google ScholarGoogle Scholar
  25. Li J, Wang Z, Zhang X "Large scale instance matching via multiple indexes and candidate selection," Knowledge-Based Systems, 2013, 50(Complete):112-120.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Wick M, Singh S, Mccallum A. "A discriminative hierarchical model for fast coreference at large scale," Proc Acl, 2012:379-388Google ScholarGoogle Scholar
  27. Guan S, Jin X, Jia Y "Self-learning and embedding based entity alignment," IEEE International Conference on Big Knowledge. 2017.Google ScholarGoogle Scholar
  28. Wu G, Ying H, Hu X. "Entity linking: an issue to extract corresponding entity with knowledge base," IEEE Access, 2018, 6(99):1-1.Google ScholarGoogle Scholar
  29. McNeill N, Kardes H, Borthwick A. "Dynamic record blocking: efficient linking of massive databases in mapreduce," Proceedings of the 10th International Workshop on Quality in Databases (QDB). 2012.Google ScholarGoogle Scholar
  30. Xu C "Research on Markov logic networks," Journal of software, 2011, 22 (8): 1699-1713Google ScholarGoogle ScholarCross RefCross Ref
  31. Wang X, Liu K, He S "Multi-Source Knowledge Bases Entity Alignment by Leveraging Semantic Tags," Journal of computer science, 2017 (03): 169-179Google ScholarGoogle Scholar
  32. Gao S, Xing Z, Ma Y "Enhancing knowledge sharing in stack overflow via automatic external web resources linking," International Conference on Engineering of Complex Computer Systems. 2018.Google ScholarGoogle Scholar
  33. Lacoste-Julien S, Palla K, Davies A "Sigma: Simple greedy matching for aligning large knowledge bases," Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2013: 572-580.Google ScholarGoogle Scholar
  34. Christen P. "Development and user experiences of an open source data cleaning, deduplication and record linkage system," ACM SIGKDD Explorations Newsletter, 2009, 11(1): 39-48Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Zhao D, Ning Z, Ming X "An Improved User Identification Method Across Social Networks Via Tagging Behaviors," 2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, 2018.Google ScholarGoogle Scholar
  36. Tang R, Miao Z, Jiang S "Interlayer link prediction in multiplex social networks based on multiple types of consistency between embedding vectors," IEEE Transactions on Cybernetics, 2021.Google ScholarGoogle Scholar
  37. Majeed A, Lee S. "Anonymization techniques for privacy preserving data publishing: A comprehensive survey," IEEE access, 2020, 9: 8512-8545.Google ScholarGoogle ScholarCross RefCross Ref
  38. Tang R, Jiang S, Chen X "Interlayer link prediction in multiplex social networks: an iterative degree penalty algorithm," Knowledge-Based Systems, 2020, 194: 105598.Google ScholarGoogle ScholarCross RefCross Ref
  39. Milne D, Witten I H. "Learning to link with wikipedia,"Proceedings of the 17th ACM conference on Information and knowledge management. ACM, 2008: 509-518.Google ScholarGoogle Scholar
  40. Shen W, Wang J, Han J. "Entity linking with a knowledge base: issues, techniques, and solutions," IEEE Transactions on Knowledge and Data Engineering, 2015, 27(2):443-460.Google ScholarGoogle ScholarCross RefCross Ref
  41. Zhang C, Miao Z, Xiao H "Knowledge graph embedding for hyper-relational data," Tsinghua Science & Technology, 2017, 22(2):185-197.Google ScholarGoogle ScholarCross RefCross Ref
  42. Bordes A, Usunier N, Garcia-Duran A "Translating embeddings for modeling multi-relational data," Advances in neural information processing systems. 2013: 2787-2795.Google ScholarGoogle Scholar

Index Terms

  1. User entity alignment method based on cross-attribute knowledge association

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CSAI '23: Proceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence
      December 2023
      563 pages
      ISBN:9798400708688
      DOI:10.1145/3638584

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 March 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format