Abstract
Nonnegative matrix factorization (NMF) approximates a nonnegative matrix by the product of two low-rank nonnegative matrices. Since it gives semantically meaningful result that is easily interpretable in clustering applications, NMF has been widely used as a clustering method especially for document data, and as a topic modeling method.We describe several fundamental facts of NMF and introduce its optimization framework called block coordinate descent. In the context of clustering, our framework provides a flexible way to extend NMF such as the sparse NMF and the weakly-supervised NMF. The former provides succinct representations for better interpretations while the latter flexibly incorporate extra information and user feedback in NMF, which effectively works as the basis for the visual analytic topic modeling system that we present.Using real-world text data sets, we present quantitative experimental results showing the superiority of our framework from the following aspects: fast convergence, high clustering accuracy, sparse representation, consistent output, and user interactivity. In addition, we present a visual analytic system called UTOPIAN (User-driven Topic modeling based on Interactive NMF) and show several usage scenarios.Overall, our book chapter cover the broad spectrum of NMF in the context of clustering and topic modeling, from fundamental algorithmic behaviors to practical visual analytics systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The nonnegative rank of a matrix \(X \in \mathbb{R}_{+}^{m\times n}\) is the smallest number \(\hat{k}\) such that X = WH where \(W \in \mathbb{R}_{+}^{m\times \hat{k}}\) and \(H \in \mathbb{R}_{+}^{\hat{k}\times n}\).
- 2.
- 3.
- 4.
The term “weakly-supervised” can be considered similar to semi-supervised clustering settings, rather than supervised learning settings such as classification and regression problems.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
Results given by the sparseness measure based on L 1 and L 2 norms in [21] are similar in terms of comparison between the three NMF versions.
- 12.
For LDA, we used Mallet [41], a widely-accepted software library based on a Gibbs sampling method.
- 13.
References
Arora S, Ge R, Kannan R, Moitra A (2012) Computing a nonnegative matrix factorization – provably. In: Proceedings of the 44th symposium on theory of computing (STOC), pp 145–162
Arora S, Ge R, Halpern Y, Mimno D, Moitra A, Sontag D, Wu Y, Zhu M (2013). A practical algorithm for topic modeling with provable guarantees. J Mach Learn Res 28(2):280–288
Berman A, Plemmons RJ (1994) Nonnegative matrices in the mathematical sciences. SIAM, Philadelphia
Bertsekas DP (1999) Nonlinear programming, 2nd edn. Athena Scientific, Belmont
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022
Brunet J-P, Tamayo P, Golub TR, Mesirov JP (2004) Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci USA 101(12):4164–4169
Cai D, He X, Han J, Huang TS (2011) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560
Choo J, Park H (2013) Customizing computational methods for visual analytics with big data. IEEE Comput Graph Appl 33(4):22–28
Choo J, Lee C, Reddy CK, Park H (2013) UTOPIAN: user-driven topic modeling based on interactive nonnegative matrix factorization. IEEE Trans Vis Comput Graph 19(12):1992–2001
Cichocki A, Zdunek R, Phan AH, Amari S (2009) Nonnegative matrix and tensor factorizations: applications to exploratory multi-way data analysis and blind source separation. Wiley, London
Devarajan K (2008) Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS Comput Biol 4(7):e1000029
Dhillon IS, Sra S (2005) Generalized nonnegative matrix approximations with Bregman divergences. In: Advances in neural information processing systems (NIPS), vol 18, pp 283–290
Ding C, He X, Simon HD (2005) On the equivalence of nonnegative matrix factorization and spectral clustering. In: Proceedings of SIAM international conference on data mining (SDM), pp 606–610
Ding C, Li T, Jordan M (2008) Nonnegative matrix factorization for combinatorial optimization: spectral clustering, graph matching, and clique finding. In: Proceedings of the 8th IEEE international conference on data mining (ICDM), pp 183–192
Ding C, T Li, Jordan MI (2010) Convex and semi-nonnegative matrix factorization. IEEE Trans Pattern Anal Mach Intell 32(1):45–55
Duda RO, Hart PE, Stork DG (2000) Pattern classification. Wiley-Interscience, London
Globerson A, Chechik G, Pereira F, Tishby N (2007) Euclidean embedding of co-occurrence data. J Mach Learn Res 8:2265–2295
Gonzales EF, Zhang Y (2005) Accelerating the Lee-Seung algorithm for non-negative matrix factorization. Technical Report TR05-02, Rice University
Grippo L, Sciandrone M (2000) On the convergence of the block nonlinear Gauss-Seidel method under convex constraints. Oper Res Lett 26:127–136
Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval (SIGIR)
Hoyer PO (2004) Non-negative matrix factorization with sparseness constraints. J Mach Learn Res 5:1457–1469
Kim H, Park H (2007) Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis. Bioinformatics 23(12):1495–1502
Kim H, Park H (2008) Nonnegative matrix factorization based on alternating non-negativity-constrained least squares and the active set method. SIAM J Matrix Anal Appl 30(2):713–730
Kim D, Sra S, Dhillon I (2007) Fast Newton-type methods for the least squares nonnegative matrix approximation problem. In: Proceedings of SIAM international conference on data mining (SDM), pp 343–354
Kim J, He Y, Park H (2014) Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework. J Global Optim 58(2):285–319
Kim J, Park H (2008) Sparse nonnegative matrix factorization for clustering. Technical Report GT-CSE-08-01, Georgia Institute of Technology
Kim J, Park H (2008) Toward faster nonnegative matrix factorization: a new algorithm and comparisons. In: Proceedings of the 8th IEEE international conference on data mining (ICDM), pp 353–362
Kim J, Park H (2011) Fast nonnegative matrix factorization: An active-set-like method and comparisons. SIAM J Sci Comput 33(6):3261–3281
Kuang D, Park H (2013) Fast rank-2 nonnegative matrix factorization for hierarchical document clustering. In: Proceedings of the 19th ACM international conference on knowledge discovery and data mining (KDD), pp 739–747
Kuang D, Ding C, Park H (2012) Symmetric nonnegative matrix factorization for graph clustering. In: Proceedings of SIAM international conference on data mining (SDM), pp 106–117
Kuhn HW (1955) The Hungarian method for the assignment problem. Nav Res Logistics Q 2:83–97
Lawson CL, Hanson RJ (1974) Solving least squares problems. Prentice Hall, Englewood Cliffs
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401:788–791
Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems (NIPS), vol 14, pp 556–562
Lewis DD, Yang Y, Rose TG, Li F (2004) Rcv1: a new benchmark collection for text categorization research. J Mach Learn Res 5:361–397
Li S, Hou XW, Zhang HJ, Cheng QS (2001) Learning spatially localized, parts-based representation. In: Proceedings of the 2001 IEEE conference on computer vision and pattern recognition (CVPR), pp 207–212
Li T, Ding C, Jordan MI (2007) Solving consensus and semi-supervised clustering problems using nonnegative matrix factorization. In: Proceedings of the 7th IEEE international conference on data mining (ICDM), pp 577–582
Li L, Lebanon G, Park H (2012) Fast Bregman divergence NMF using Taylor expansion and coordinate descent. In: Proceedings of the 18th ACM international conference on knowledge discovery and data mining (KDD), pp 307–315
Lin C-J (2007) Projected gradient methods for nonnegative matrix factorization. Neural Comput 19(10):2756–2779
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge
McCallum AK (2002) MALLET: a machine learning for language toolkit. http://mallet.cs.umass.edu
Monti S, Tamayo P, Mesirov J, Golub T (2003) Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data. Mach Learn 52(1–2):91–118
Paatero P, Tapper U (1994) Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5:111–126
Pauca VP, Shahnaz F, Berry MW, Plemmons RJ (2004) Text mining using non-negative matrix factorizations. In: Proceedings of SIAM international conference on data mining (SDM), pp 452–456
Pauca VP, Piper J, Plemmons RJ (2006) Nonnegative matrix factorization for spectral data analysis. Linear Algebra Appl 416(1):29–47
Shahnaz F, Berry MW, Pauca VP, Plemmons RJ (2006) Document clustering using nonnegative matrix factorization. Inf Process Manag 42:373–386
Tibshirani R (1994) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol 58:267–288
van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605
Vavasis SA (2009) On the complexity of nonnegative matrix factorization. SIAM J Optim 20(3):1364–1377
Wild S, Curry J, Dougherty A (2004) Improving non-negative matrix factorizations through structured initialization. Pattern Recognit 37:2217–2232
Xie B, Song L, Park H (2013) Topic modeling via nonnegative matrix factorization on probability simplex. In: NIPS workshop on topic models: computation, application, and evaluation
Xu W, Liu X, Gong Y (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR), pp 267–273
Acknowledgements
The work of the authors was supported in part by the National Science Foundation (NSF) grants CCF-0808863 and the Defense Advanced Research Projects Agency (DARPA) XDATA program grant FA8750-12-2-0309. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF or the DARPA.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Kuang, D., Choo, J., Park, H. (2015). Nonnegative Matrix Factorization for Interactive Topic Modeling and Document Clustering. In: Celebi, M. (eds) Partitional Clustering Algorithms. Springer, Cham. https://doi.org/10.1007/978-3-319-09259-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-09259-1_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09258-4
Online ISBN: 978-3-319-09259-1
eBook Packages: EngineeringEngineering (R0)