research-article

Tripartite graph clustering for dynamic sentiment analysis on social media

Authors:
Linhong Zhu

University of Southern California, Los angeles, CA, USA

University of Southern California, Los angeles, CA, USA
View Profile

,
Aram Galstyan

University of Southern California, Los angeles, CA, USA

University of Southern California, Los angeles, CA, USA
View Profile

,
James Cheng

The Chinese University of Hong Kong, Hong Kong, China

The Chinese University of Hong Kong, Hong Kong, China
View Profile

,
Kristina Lerman

University of Southern California, Los angeles, CA, USA

University of Southern California, Los angeles, CA, USA
View Profile

SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of DataJune 2014Pages 1531–1542https://doi.org/10.1145/2588555.2593682

Published:18 June 2014Publication History

SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data

Pages 1531–1542

ABSTRACT

The growing popularity of social media (e.g., Twitter) allows users to easily share information with each other and influence others by expressing their own sentiments on various subjects. In this work, we propose an unsupervised tri-clustering framework, which analyzes both user-level and tweet-level sentiments through co-clustering of a tripartite graph. A compelling feature of the proposed framework is that the quality of sentiment clustering of tweets, users, and features can be mutually improved by joint clustering. We further investigate the evolution of user-level sentiments and latent feature vectors in an online framework and devise an efficient online algorithm to sequentially update the clustering of tweets, users and features with newly arrived data. The online framework not only provides better quality of both dynamic user-level and tweet-level sentiment analysis, but also improves the computational and storage efficiency. We verified the effectiveness and efficiency of the proposed approaches on the November 2012 California ballot Twitter data.

References

S. Arora, R. Ge, R. Kannan, and A. Moitra. Computing a nonnegative matrix factorization -- provably. In STOC Conference, pages 145--162. ACM, 2012. Google ScholarDigital Library
L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In COLING Conference, pages 36--44. Association for Computational Linguistics, 2010. Google ScholarDigital Library
D. Cai, X. He, J. Han, and T. S. Huang. Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell., 33(8):1548--1560, 2011. Google ScholarDigital Library
B. Cao, D. Shen, J.-T. Sun, X. Wang, Q. Yang, and Z. Chen. Detect and track latent factors with online nonnegative matrix factorization. In IJCAI Conference, pages 2689--2694. Morgan Kaufmann Publishers Inc., 2007. Google ScholarDigital Library
M. Castellanos, U. Dayal, M. Hsu, R. Ghosh, M. Dekhil, Y. Lu, L. Zhang, and M. Schreiman. Lci: a social channel analysis platform for live customer intelligence. In SIGMOD Conference, pages 1049--1058, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In COLING Conference, pages 241--249, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics. Google ScholarDigital Library
H. Deng, J. Han, H. Ji, H. Li, Y. Lu, and H. Wang. Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networks. In SDM, pages 378--386. SIAM, 2013.Google Scholar
K. Devarajan. Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS Comput Biol, 4(7):e1000029, 2008.Google ScholarCross Ref
C. Ding, T. Li, W. Peng, and H. Park. Orthogonal nonnegative matrix t-factorizations for clustering. In SIGKDD Conference, pages 126--135. ACM, 2006. Google ScholarDigital Library
B. Gao, T.-Y. Liu, X. Zheng, Q.-S. Cheng, and W.-Y. Ma. Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. In SIGKDD Conference, pages 41--50. ACM, 2005. Google ScholarDigital Library
A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. Technical report, pages 1--6, 2009.Google Scholar
A. B. Goldberg and X. Zhu. Seeing stars when there aren't many stars: Graph-based semi-supervised learning for sentiment categorization. In TextGraphs WorkShop, pages 45--52. Association for Computational Linguistics, 2006. Google ScholarDigital Library
Q. Gu and J. Zhou. Co-clustering on manifolds. In SIGKDD Conference, pages 359--368. ACM, 2009. Google ScholarDigital Library
V. Hatzivassiloglou and K. R. McKeown. Predicting the semantic orientation of adjectives. In COLING/EACL Conference, pages 174--181, Morristown, NJ, USA, 1997. Association for Computational Linguistics. Google ScholarDigital Library
X. Hu, J. Tang, H. Gao, and H. Liu. Unsupervised sentiment analysis with emotional signals. In World Wide Web Conference. ACM, 2013. Google ScholarDigital Library
J. Kim and H. Park. Fast nonnegative matrix factorization: An active-set-like method and comparisons. SIAM J. Sci. Comput., 33(6):3261--3281, 2011. Google ScholarDigital Library
J. Kim, J. Yoo, H. Lim, H. Qiu, Z. Kozareva, and A. Galstyan. Sentiment prediction using collaborative filtering. In ICWSM'13, 2013.Google Scholar
H. W. Kuhn and A. W. Tucker. Nonlinear programming. In Proceedings of the 2nd Berkeley Symposium on Mathematical Statistics and Probability, pages 481--492. University of California Press, Berkeley, CA, USA, 1950.Google Scholar
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In NIPS Conference, pages 556--562. MIT Press, 2000.Google Scholar
C.-J. Lin. Projected gradient methods for nonnegative matrix factorization. Neural Comput., 19(10):2756--2779, 2007. Google ScholarDigital Library
J. Lin and A. Kolcz. Large-scale machine learning at twitter. In SIGMOD Conference, pages 793--804. ACM, 2012. Google ScholarDigital Library
M. Long, J. Wang, G. Ding, D. Shen, and Q. Yang. Transfer learning with graph co-regularization. In AAAI Conference, 2012.Google Scholar
P. Melville, W. Gryc, and R. D. Lawrence. Sentiment analysis of blogs by combining lexical knowledge with text classification. In SIGKDD Conference, pages 1275--1284, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
L. T. Nguyen, P. Wu, W. Chan, W. Peng, and Y. Zhang. Predicting collective sentiment dynamics from time-series social media. In WSDM Workshop, pages 6:1--6:8. ACM, 2012. Google ScholarDigital Library
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up : sentiment classification using machine learning techniques. In EMNLP Conference, pages 79--86, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics. Google ScholarDigital Library
A. Saha and V. Sindhwani. Learning evolving and emerging topics in social media: a dynamic nmf approach with temporal regularization. In WSDM Conference, pages 693--702. ACM, 2012. Google ScholarDigital Library
L. Smith, L. Zhu, K. Lerman, and Z. Kozareva. The role of social media in the discussion of controversial topics. In SocialCom/PASSAT Conference, 2013. Google ScholarDigital Library
M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Workshop on Unsupervised Learning in NLP, pages 53--63. Association for Computational Linguistics, 2011. Google ScholarDigital Library
C. Tan, L. Lee, J. Tang, L. Jiang, M. Zhou, and P. Li. User-level sentiment analysis incorporating social networks. In SIGKDD Conference, pages 1397--1405, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
F. Wang, P. Li, and A. C. Knig. Efficient document clustering via online nonnegative matrix factorizations. In SDM Conference, pages 908--919, 2011.Google ScholarCross Ref
X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang. Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In CIKM Conference, pages 1031--1040, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In HLT/EMNLP Conference, pages 347--354. Association for Computational Linguistics, 2005. Google ScholarDigital Library
Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In SIGMOD Conference, pages 505--516, 2012. Google ScholarDigital Library
Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. Gbagc: A general bayesian framework for attributed graph clustering. To appear in TKDD, 2014.Google Scholar
L. Zhu, A. Galstyan, J. Cheng, and K. Lerman. Tripartite graph clustering for dynamic sentiment analysis on social media. Technical Report (arXiv:1402.6010v2 {cs.SI}), 2014. Google ScholarDigital Library
L. Zhu, S. Gao, S. J. Pan, H. Li, D. Deng, and C. Shahabi. Graph-based informative-sentence selection for opinion summarization. In ASONAM Conference, pages 408--412, 2013. Google ScholarDigital Library
F. Zhuang, P. Luo, H. Xiong, Q. He, Y. Xiong, and Z. Shi. Exploiting associations between word clusters and document classes for cross-domain text categorization. Stat. Anal. Data Min., 4(1):100--114, 2011. Google ScholarDigital Library

Index Terms

Tripartite graph clustering for dynamic sentiment analysis on social media
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Detecting bursts in sentiment-aware topics from social media

Nowadays plenty of user-generated posts, e.g., sina weibos, are published on the social media. The posts contain the publics sentiments (i.e., positive or negative) towards various topics. Bursty sentiment-aware topics from these posts reveal sentiment-...
Read More
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog

As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Read More
Contextual sentiment analysis for social media genres

The lexicon-based approaches to opinion mining involve the extraction of term polarities from sentiment lexicons and the aggregation of such scores to predict the overall sentiment of a piece of text. It is typically preferred where sentiment labelled ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
June 2014
1645 pages
ISBN:9781450323765
DOI:10.1145/2588555
General Chairs:
Curtis Dyreson
Utah State University, USA
,
Feifei Li
University of Utah, USA
,
Program Chair:
M. Tamer Özsu
University of Waterloo, Canada
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 June 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
matrix factorization
sentiment analysis
tripartite graph clustering
Qualifiers
- research-article
Conference

Acceptance Rates
SIGMOD '14 Paper Acceptance Rate107of421submissions,25%Overall Acceptance Rate785of4,003submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 41
  Total Citations
  View Citations
- 1,157
  Total Downloads
- Downloads (Last 12 months)29
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Tripartite graph clustering for dynamic sentiment analysis on social media

SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data

ABSTRACT

References

Cited By

Index Terms

Recommendations

Detecting bursts in sentiment-aware topics from social media

Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog

Contextual sentiment analysis for social media genres