skip to main content
10.1145/2588555.2593682acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Tripartite graph clustering for dynamic sentiment analysis on social media

Authors Info & Claims
Published:18 June 2014Publication History

ABSTRACT

The growing popularity of social media (e.g., Twitter) allows users to easily share information with each other and influence others by expressing their own sentiments on various subjects. In this work, we propose an unsupervised tri-clustering framework, which analyzes both user-level and tweet-level sentiments through co-clustering of a tripartite graph. A compelling feature of the proposed framework is that the quality of sentiment clustering of tweets, users, and features can be mutually improved by joint clustering. We further investigate the evolution of user-level sentiments and latent feature vectors in an online framework and devise an efficient online algorithm to sequentially update the clustering of tweets, users and features with newly arrived data. The online framework not only provides better quality of both dynamic user-level and tweet-level sentiment analysis, but also improves the computational and storage efficiency. We verified the effectiveness and efficiency of the proposed approaches on the November 2012 California ballot Twitter data.

References

  1. S. Arora, R. Ge, R. Kannan, and A. Moitra. Computing a nonnegative matrix factorization -- provably. In STOC Conference, pages 145--162. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In COLING Conference, pages 36--44. Association for Computational Linguistics, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Cai, X. He, J. Han, and T. S. Huang. Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell., 33(8):1548--1560, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Cao, D. Shen, J.-T. Sun, X. Wang, Q. Yang, and Z. Chen. Detect and track latent factors with online nonnegative matrix factorization. In IJCAI Conference, pages 2689--2694. Morgan Kaufmann Publishers Inc., 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Castellanos, U. Dayal, M. Hsu, R. Ghosh, M. Dekhil, Y. Lu, L. Zhang, and M. Schreiman. Lci: a social channel analysis platform for live customer intelligence. In SIGMOD Conference, pages 1049--1058, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In COLING Conference, pages 241--249, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Deng, J. Han, H. Ji, H. Li, Y. Lu, and H. Wang. Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networks. In SDM, pages 378--386. SIAM, 2013.Google ScholarGoogle Scholar
  8. K. Devarajan. Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS Comput Biol, 4(7):e1000029, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  9. C. Ding, T. Li, W. Peng, and H. Park. Orthogonal nonnegative matrix t-factorizations for clustering. In SIGKDD Conference, pages 126--135. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Gao, T.-Y. Liu, X. Zheng, Q.-S. Cheng, and W.-Y. Ma. Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. In SIGKDD Conference, pages 41--50. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. Technical report, pages 1--6, 2009.Google ScholarGoogle Scholar
  12. A. B. Goldberg and X. Zhu. Seeing stars when there aren't many stars: Graph-based semi-supervised learning for sentiment categorization. In TextGraphs WorkShop, pages 45--52. Association for Computational Linguistics, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Q. Gu and J. Zhou. Co-clustering on manifolds. In SIGKDD Conference, pages 359--368. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. Hatzivassiloglou and K. R. McKeown. Predicting the semantic orientation of adjectives. In COLING/EACL Conference, pages 174--181, Morristown, NJ, USA, 1997. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. X. Hu, J. Tang, H. Gao, and H. Liu. Unsupervised sentiment analysis with emotional signals. In World Wide Web Conference. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Kim and H. Park. Fast nonnegative matrix factorization: An active-set-like method and comparisons. SIAM J. Sci. Comput., 33(6):3261--3281, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Kim, J. Yoo, H. Lim, H. Qiu, Z. Kozareva, and A. Galstyan. Sentiment prediction using collaborative filtering. In ICWSM'13, 2013.Google ScholarGoogle Scholar
  18. H. W. Kuhn and A. W. Tucker. Nonlinear programming. In Proceedings of the 2nd Berkeley Symposium on Mathematical Statistics and Probability, pages 481--492. University of California Press, Berkeley, CA, USA, 1950.Google ScholarGoogle Scholar
  19. D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In NIPS Conference, pages 556--562. MIT Press, 2000.Google ScholarGoogle Scholar
  20. C.-J. Lin. Projected gradient methods for nonnegative matrix factorization. Neural Comput., 19(10):2756--2779, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Lin and A. Kolcz. Large-scale machine learning at twitter. In SIGMOD Conference, pages 793--804. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Long, J. Wang, G. Ding, D. Shen, and Q. Yang. Transfer learning with graph co-regularization. In AAAI Conference, 2012.Google ScholarGoogle Scholar
  23. P. Melville, W. Gryc, and R. D. Lawrence. Sentiment analysis of blogs by combining lexical knowledge with text classification. In SIGKDD Conference, pages 1275--1284, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. L. T. Nguyen, P. Wu, W. Chan, W. Peng, and Y. Zhang. Predicting collective sentiment dynamics from time-series social media. In WSDM Workshop, pages 6:1--6:8. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up : sentiment classification using machine learning techniques. In EMNLP Conference, pages 79--86, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Saha and V. Sindhwani. Learning evolving and emerging topics in social media: a dynamic nmf approach with temporal regularization. In WSDM Conference, pages 693--702. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. L. Smith, L. Zhu, K. Lerman, and Z. Kozareva. The role of social media in the discussion of controversial topics. In SocialCom/PASSAT Conference, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Workshop on Unsupervised Learning in NLP, pages 53--63. Association for Computational Linguistics, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Tan, L. Lee, J. Tang, L. Jiang, M. Zhou, and P. Li. User-level sentiment analysis incorporating social networks. In SIGKDD Conference, pages 1397--1405, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. F. Wang, P. Li, and A. C. Knig. Efficient document clustering via online nonnegative matrix factorizations. In SDM Conference, pages 908--919, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  31. X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang. Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In CIKM Conference, pages 1031--1040, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In HLT/EMNLP Conference, pages 347--354. Association for Computational Linguistics, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In SIGMOD Conference, pages 505--516, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. Gbagc: A general bayesian framework for attributed graph clustering. To appear in TKDD, 2014.Google ScholarGoogle Scholar
  35. L. Zhu, A. Galstyan, J. Cheng, and K. Lerman. Tripartite graph clustering for dynamic sentiment analysis on social media. Technical Report (arXiv:1402.6010v2 {cs.SI}), 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. L. Zhu, S. Gao, S. J. Pan, H. Li, D. Deng, and C. Shahabi. Graph-based informative-sentence selection for opinion summarization. In ASONAM Conference, pages 408--412, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. F. Zhuang, P. Luo, H. Xiong, Q. He, Y. Xiong, and Z. Shi. Exploiting associations between word clusters and document classes for cross-domain text categorization. Stat. Anal. Data Min., 4(1):100--114, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Tripartite graph clustering for dynamic sentiment analysis on social media

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
      June 2014
      1645 pages
      ISBN:9781450323765
      DOI:10.1145/2588555

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 June 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGMOD '14 Paper Acceptance Rate107of421submissions,25%Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader