ABSTRACT
The growing popularity of social media (e.g., Twitter) allows users to easily share information with each other and influence others by expressing their own sentiments on various subjects. In this work, we propose an unsupervised tri-clustering framework, which analyzes both user-level and tweet-level sentiments through co-clustering of a tripartite graph. A compelling feature of the proposed framework is that the quality of sentiment clustering of tweets, users, and features can be mutually improved by joint clustering. We further investigate the evolution of user-level sentiments and latent feature vectors in an online framework and devise an efficient online algorithm to sequentially update the clustering of tweets, users and features with newly arrived data. The online framework not only provides better quality of both dynamic user-level and tweet-level sentiment analysis, but also improves the computational and storage efficiency. We verified the effectiveness and efficiency of the proposed approaches on the November 2012 California ballot Twitter data.
- S. Arora, R. Ge, R. Kannan, and A. Moitra. Computing a nonnegative matrix factorization -- provably. In STOC Conference, pages 145--162. ACM, 2012. Google ScholarDigital Library
- L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In COLING Conference, pages 36--44. Association for Computational Linguistics, 2010. Google ScholarDigital Library
- D. Cai, X. He, J. Han, and T. S. Huang. Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell., 33(8):1548--1560, 2011. Google ScholarDigital Library
- B. Cao, D. Shen, J.-T. Sun, X. Wang, Q. Yang, and Z. Chen. Detect and track latent factors with online nonnegative matrix factorization. In IJCAI Conference, pages 2689--2694. Morgan Kaufmann Publishers Inc., 2007. Google ScholarDigital Library
- M. Castellanos, U. Dayal, M. Hsu, R. Ghosh, M. Dekhil, Y. Lu, L. Zhang, and M. Schreiman. Lci: a social channel analysis platform for live customer intelligence. In SIGMOD Conference, pages 1049--1058, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In COLING Conference, pages 241--249, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics. Google ScholarDigital Library
- H. Deng, J. Han, H. Ji, H. Li, Y. Lu, and H. Wang. Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networks. In SDM, pages 378--386. SIAM, 2013.Google Scholar
- K. Devarajan. Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS Comput Biol, 4(7):e1000029, 2008.Google ScholarCross Ref
- C. Ding, T. Li, W. Peng, and H. Park. Orthogonal nonnegative matrix t-factorizations for clustering. In SIGKDD Conference, pages 126--135. ACM, 2006. Google ScholarDigital Library
- B. Gao, T.-Y. Liu, X. Zheng, Q.-S. Cheng, and W.-Y. Ma. Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. In SIGKDD Conference, pages 41--50. ACM, 2005. Google ScholarDigital Library
- A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. Technical report, pages 1--6, 2009.Google Scholar
- A. B. Goldberg and X. Zhu. Seeing stars when there aren't many stars: Graph-based semi-supervised learning for sentiment categorization. In TextGraphs WorkShop, pages 45--52. Association for Computational Linguistics, 2006. Google ScholarDigital Library
- Q. Gu and J. Zhou. Co-clustering on manifolds. In SIGKDD Conference, pages 359--368. ACM, 2009. Google ScholarDigital Library
- V. Hatzivassiloglou and K. R. McKeown. Predicting the semantic orientation of adjectives. In COLING/EACL Conference, pages 174--181, Morristown, NJ, USA, 1997. Association for Computational Linguistics. Google ScholarDigital Library
- X. Hu, J. Tang, H. Gao, and H. Liu. Unsupervised sentiment analysis with emotional signals. In World Wide Web Conference. ACM, 2013. Google ScholarDigital Library
- J. Kim and H. Park. Fast nonnegative matrix factorization: An active-set-like method and comparisons. SIAM J. Sci. Comput., 33(6):3261--3281, 2011. Google ScholarDigital Library
- J. Kim, J. Yoo, H. Lim, H. Qiu, Z. Kozareva, and A. Galstyan. Sentiment prediction using collaborative filtering. In ICWSM'13, 2013.Google Scholar
- H. W. Kuhn and A. W. Tucker. Nonlinear programming. In Proceedings of the 2nd Berkeley Symposium on Mathematical Statistics and Probability, pages 481--492. University of California Press, Berkeley, CA, USA, 1950.Google Scholar
- D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In NIPS Conference, pages 556--562. MIT Press, 2000.Google Scholar
- C.-J. Lin. Projected gradient methods for nonnegative matrix factorization. Neural Comput., 19(10):2756--2779, 2007. Google ScholarDigital Library
- J. Lin and A. Kolcz. Large-scale machine learning at twitter. In SIGMOD Conference, pages 793--804. ACM, 2012. Google ScholarDigital Library
- M. Long, J. Wang, G. Ding, D. Shen, and Q. Yang. Transfer learning with graph co-regularization. In AAAI Conference, 2012.Google Scholar
- P. Melville, W. Gryc, and R. D. Lawrence. Sentiment analysis of blogs by combining lexical knowledge with text classification. In SIGKDD Conference, pages 1275--1284, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
- L. T. Nguyen, P. Wu, W. Chan, W. Peng, and Y. Zhang. Predicting collective sentiment dynamics from time-series social media. In WSDM Workshop, pages 6:1--6:8. ACM, 2012. Google ScholarDigital Library
- B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up : sentiment classification using machine learning techniques. In EMNLP Conference, pages 79--86, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics. Google ScholarDigital Library
- A. Saha and V. Sindhwani. Learning evolving and emerging topics in social media: a dynamic nmf approach with temporal regularization. In WSDM Conference, pages 693--702. ACM, 2012. Google ScholarDigital Library
- L. Smith, L. Zhu, K. Lerman, and Z. Kozareva. The role of social media in the discussion of controversial topics. In SocialCom/PASSAT Conference, 2013. Google ScholarDigital Library
- M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Workshop on Unsupervised Learning in NLP, pages 53--63. Association for Computational Linguistics, 2011. Google ScholarDigital Library
- C. Tan, L. Lee, J. Tang, L. Jiang, M. Zhou, and P. Li. User-level sentiment analysis incorporating social networks. In SIGKDD Conference, pages 1397--1405, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- F. Wang, P. Li, and A. C. Knig. Efficient document clustering via online nonnegative matrix factorizations. In SDM Conference, pages 908--919, 2011.Google ScholarCross Ref
- X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang. Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In CIKM Conference, pages 1031--1040, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In HLT/EMNLP Conference, pages 347--354. Association for Computational Linguistics, 2005. Google ScholarDigital Library
- Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In SIGMOD Conference, pages 505--516, 2012. Google ScholarDigital Library
- Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. Gbagc: A general bayesian framework for attributed graph clustering. To appear in TKDD, 2014.Google Scholar
- L. Zhu, A. Galstyan, J. Cheng, and K. Lerman. Tripartite graph clustering for dynamic sentiment analysis on social media. Technical Report (arXiv:1402.6010v2 {cs.SI}), 2014. Google ScholarDigital Library
- L. Zhu, S. Gao, S. J. Pan, H. Li, D. Deng, and C. Shahabi. Graph-based informative-sentence selection for opinion summarization. In ASONAM Conference, pages 408--412, 2013. Google ScholarDigital Library
- F. Zhuang, P. Luo, H. Xiong, Q. He, Y. Xiong, and Z. Shi. Exploiting associations between word clusters and document classes for cross-domain text categorization. Stat. Anal. Data Min., 4(1):100--114, 2011. Google ScholarDigital Library
Index Terms
- Tripartite graph clustering for dynamic sentiment analysis on social media
Recommendations
Detecting bursts in sentiment-aware topics from social media
Nowadays plenty of user-generated posts, e.g., sina weibos, are published on the social media. The posts contain the publics sentiments (i.e., positive or negative) towards various topics. Bursty sentiment-aware topics from these posts reveal sentiment-...
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog
As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Contextual sentiment analysis for social media genres
The lexicon-based approaches to opinion mining involve the extraction of term polarities from sentiment lexicons and the aggregation of such scores to predict the overall sentiment of a piece of text. It is typically preferred where sentiment labelled ...
Comments