Learning Deep Representations for Graph Clustering

Authors

  • Fei Tian University of Science and Technology of China
  • Bin Gao Microsoft Research
  • Qing Cui Tsinghua University
  • Enhong Chen University of Science and Technology of China
  • Tie-Yan Liu Microsoft Research

DOI:

https://doi.org/10.1609/aaai.v28i1.8916

Keywords:

deep representations, clustering on graph, neural networks

Abstract

Recently deep learning has been successfully adopted in many applications such as speech recognition and image classification. In this work, we explore the possibility of employing deep learning in graph clustering. We propose a simple method, which first learns a nonlinear embedding of the original graph by stacked autoencoder, and then runs $k$-means algorithm on the embedding to obtain the clustering result. We show that this simple method has solid theoretical foundation, due to the similarity between autoencoder and spectral clustering in terms of what they actually optimize. Then, we demonstrate that the proposed method is more efficient and flexible than spectral clustering. First, the computational complexity of autoencoder is much lower than spectral clustering: the former can be linear to the number of nodes in a sparse graph while the latter is super quadratic due to eigenvalue decomposition. Second, when additional sparsity constraint is imposed, we can simply employ the sparse autoencoder developed in the literature of deep learning; however, it is non-straightforward to implement a sparse spectral method. The experimental results on various graph datasets show that the proposed method significantly outperforms conventional spectral clustering which clearly indicates the effectiveness of deep learning in graph clustering.

Downloads

Published

2014-06-21

How to Cite

Tian, F., Gao, B., Cui, Q., Chen, E., & Liu, T.-Y. (2014). Learning Deep Representations for Graph Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1). https://doi.org/10.1609/aaai.v28i1.8916

Issue

Section

Main Track: Machine Learning Applications