Skip to main content
Log in

HetGNN-SF: Self-supervised learning on heterogeneous graph neural network via semantic strength and feature similarity

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Heterogeneous graph neural networks (HGNNs) can effectively model multiple node types and complex interactions in real networks and solve problems in various practical applications. Self-supervised Learning-based HGNNs (SL-HGNNs) have become the current research focus in this field because they can solve the problem of difficult label acquisition in practical scenarios. These methods usually split heterogeneous graphs into multiple subgraphs based on meta-paths for separate study. However, they ignore the complex interactions between the different semantics of the graphs. In addition, they use node features as auxiliary information for heterogeneous graph representation learning and ignore the importance of features. To solve the above problems, we propose a Self-supervised Learning on Heterogeneous Graph Neural Network via Semantic strength and Feature similarity (HetGNN-SF) model. This model innovatively implements a Feature- and Topology-based Comparative optimization (FTC) method to generate weights for different meta-paths and then splits the original heterogeneous graph into different semantic subgraphs based on meta-paths. Thereafter, the different subgraphs are fused by the FTC to generate semantic fusion graphs that capture the interactions between different semantics. Semantic strength and feature similarity perspectives generate node embeddings from the semantic fusion graphs. Finally, the FTC positive and negative samples are used for contrastive learning from the two perspectives to yield the final node embeddings. Extensive experiments are conducted on three real datasets using the proposed HetGNN-SF model; the results reveal that HetGNN-SF outperforms state-of-the-art models. Our data and code are available on GitHub (https://github.com/LiuXMaa/HetGNN-SF-.git).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data Availability

The datasets used in the experiments are publicly available in the online repository.

Notes

  1. https://github.com/Andy-Border/NSHE

  2. https://github.com/Andy-Border/NSHE

  3. https://github.com/cynricfu/MAGNN

References

  1. Stankova M, Praet S, Martens D, Provost F (2021) Node classification over bipartite graphs through projection. Mach Learn 110(1):37–87

    Article  MathSciNet  MATH  Google Scholar 

  2. Wu M, Pan S, Zhu X (2021) Openwgl: open-world graph learning for unseen class node classification. Knowl Inf Syst 63(9):2405–2430

    Article  Google Scholar 

  3. Fan S, Zhu J, Han X, Shi C, Hu L, Ma B, Li Y (2019) Metapath-guided heterogeneous graph neural network for intent recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining pp. 2478–2486

  4. Liu T, He Z (2022) Dlir: a deep learning-based initialization recommendation algorithm for trust-aware recommendation. Appl Intell 1–12

  5. Lin R, Tang Y, Yuan C, He C, Li W (2022) Scholat link prediction: A link prediction dataset fusing topology and attribute information. In: CCF Conference on Computer Supported Cooperative Work and Social Computing pp. 340–351. Springer

  6. Li X, Shang Y, Cao Y, Li Y, Tan J, Liu Y (2020) Type-aware anchor link prediction across heterogeneous networks based on graph attention network. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 34, pp. 147–155

  7. Shi C, Li Y, Zhang J, Sun Y, Philip SY (2016) A survey of heterogeneous information network analysis. IEEE Trans Knowl Data Eng 29(1):17–37

    Article  Google Scholar 

  8. Schlichtkrull M, Kipf TN, Bloem P, Berg R, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European Semantic Web Conference vol. 10843, pp. 593–607. Springer

  9. Wang R, Shi C, Zhao T, Wang X, Ye YF (2021) Heterogeneous information network embedding with adversarial disentangler. IEEE Trans Knowl Data Eng

  10. Park C, Kim D, Han J, Yu H (2020) Unsupervised attributed multiplex network embedding. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 34, pp. 5371–5378

  11. Wang X, Liu N, Han H, Shi C (2021) Self-supervised heterogeneous graph neural network with co-contrastive learning. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 1726–1736

  12. Lin Z, Tian C, Hou Y, Zhao WX (2022) Improving graph collaborative filtering with neighborhood-enriched contrastive learning. In: Proceedings of the ACM Web Conference 2022 pp. 2320–2329

  13. Wang X, Ji H, Shi C, Wang B, Ye Y, Cui P, Yu PS (2019) Heterogeneous graph attention network. In: The World Wide Web Conference pp. 2022–2032

  14. Jiang X, Jia T, Fang Y, Shi C, Lin Z, Wang H (2021) Pre-training on large-scale heterogeneous graph. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 756–766

  15. Wang C, Zhou S, Yu K, Chen D, Li B, Feng Y, Chen C (2022) Collaborative knowledge distillation for heterogeneous information network embedding. In: Proceedings of the ACM Web Conference 2022 pp. 1631–1639

  16. Wang Y, Zhang Z, Liu T, Xu H, Wang J, Guo L (2020) Fine-grained semantics-aware heterogeneous graph neural networks. In: International Conference on Web Information Systems Engineering pp. 71–82. Springer

  17. Jin M, Zheng Y, Li Y, Gong C, Zhou C, Pan S (2021) Multi-scale contrastive siamese networks for self-supervised graph representation learning. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021 pp. 1477–1483

  18. Welling M, Kipf TN (2017) Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations, ICLR

  19. Velickovic P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2018) Graph attention networks. In: Proceedings of the 6th International Conference on Learning Representations, ICLR

  20. Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 30:1024–1034

    Google Scholar 

  21. Wang X, Zhu M, Bo D, Cui P, Shi C, Pei J (2020) Am-gcn: adaptive multi-channel graph convolutional networks. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining pp. 1243–1253

  22. Sun Y, Deng H, Yang Y, Wang C, Xu J, Huang R, Cao L, Wang Y, Chen L (2022) Beyond homophily: structure-aware path aggregation graph neural network. In: Raedt, L.D. (ed.) Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022 pp. 2233–2240

  23. Ji H, Wang X, Shi C, Wang B, Yu P (2021) Heterogeneous graph propagation network. IEEE Trans Knowl Data Eng 1–12

  24. Fu X, Zhang J, Meng Z, King I (2020) Magnn: Metapath aggregated graph neural network for heterogeneous graph embedding. In: Proceedings of The Web Conference 2020 pp. 2331–2341

  25. Yun S, Jeong M, Kim R, Kang J, Kim HJ (2019) Graph transformer networks. Adv Neural Inf Process Syst 32:11960–11970

    Google Scholar 

  26. Zhang C, Song D, Huang C, Swami A, Chawla NV (2019) Heterogeneous graph neural network. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 793–803

  27. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  28. Hu Z, Dong Y, Wang K, Sun Y (2020) Heterogeneous graph transformer. In: Proceedings of The Web Conference 2020 pp. 2704–2710

  29. Lv Q, Ding M, Liu Q, Chen Y, Feng W, He S, Zhou C, Jiang J, Dong Y, Tang J (2021) Are we really making much progress? revisiting, benchmarking and refining heterogeneous graph neural networks. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 1150–1160

  30. Zhao J, Wang X, Shi C, Hu B, Song G, Ye Y (2021) Heterogeneous graph structure learning for graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 35, pp. 4697–4705

  31. Yang Y, Guan Z, Li J, Zhao W, Cui J, Wang Q (2021) Interpretable and efficient heterogeneous graph convolutional network. IEEE Trans Knowl Data Eng 1–14

  32. Zhang M, Wang X, Zhu M, Shi C, Zhang Z, Zhou J (2022) Robust heterogeneous graph neural networks against adversarial attacks. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 36, pp. 4363–4370

  33. Liu X, Zhang F, Hou Z, Mian L, Wang Z, Zhang J, Tang J (2021) Self-supervised learning: generative or contrastive. IEEE Trans Knowl Data Eng 35(1):857–876

    Google Scholar 

  34. Velickovic P, Fedus W, Hamilton WL, Liò P, Bengio Y, Hjelm RD (2019) Deep graph infomax. ICLR (Poster) 2(3):4

    Google Scholar 

  35. Ren Y, Liu B, Huang C, Dai P, Bo L, Zhang J (2019) Heterogeneous deep graph infomax. arXiv:1911.08538

  36. Zhao J, Wang X, Shi C, Liu Z, Ye Y (2020) Network schema preserving heterogeneous information network embedding. In: International Joint Conference on Artificial Intelligence (IJCAI) pp. 1366–1372

  37. Jiang X, Lu Y, Fang Y, Shi C (2021) Contrastive pre-training of gnns on heterogeneous graphs. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management pp. 803–812

  38. He D, Liang C, Huo C, Feng Z, Jin D, Yang L, Zhang W (2022) Analyzing heterogeneous networks with missing attributes by unsupervised contrastive learning. IEEE Trans Neural Netw Learn Syst

  39. Zhu Y, Xu Y, Cui H, Yang C, Liu Q, Wu S (2022) Structure-enhanced heterogeneous graph contrastive learning. In: Proceedings of the 2022 SIAM International Conference on Data Mining (SDM) pp. 82–90. SIAM

  40. Dong Y, Chawla NV, Swami A (2017) metapath2vec: scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 135–144

  41. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 701–710

  42. Hussein R, Yang D, Cudré-Mauroux P (2018) Are meta-paths necessary? revisiting heterogeneous graph embeddings. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management pp. 437–446

  43. He D, Zhai L, Li Z, Di Jin 0001, Yang L, Huang Y, Philip SY (2020) Adversarial mutual information learning for network embedding. In: IJCAI pp. 3321–3327

  44. Van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11)

Download references

Acknowledgements

This research is supported by the National Natural Science Foundation of China (Grant No.62072288, 61702306), the Taishan Scholar Program of Shandong Province, the Natural Science Foundation of Shandong Province (Grant No.ZR2022MF268).

Funding

This study was funded by the National Natural Science Foundation of China (Grant No.62072288, 61702306), the Taishan Scholar Program of Shandong Province, the Natural Science Foundation of Shandong Province (Grant No.ZR2022MF268).

Author information

Authors and Affiliations

Authors

Contributions

Chao Li, Xinming Liu, Zhongying Zhao wrote the main manuscript text; Xinming Liu and Yeyu Yan prepared the result of our experiments; All authors reviewed the manuscript.

Corresponding author

Correspondence to Zhongying Zhao.

Ethics declarations

Consent to participate

There is the consent of all authors.

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, C., Liu, X., Yan, Y. et al. HetGNN-SF: Self-supervised learning on heterogeneous graph neural network via semantic strength and feature similarity. Appl Intell 53, 21902–21919 (2023). https://doi.org/10.1007/s10489-023-04612-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-023-04612-6

Keywords

Navigation