ABSTRACT
Mining detailed opinions buried in the vast amount of review text data is an important, yet quite challenging task with widespread applications in multiple domains. Latent Aspect Rating Analysis (LARA) refers to the task of inferring both opinion ratings on topical aspects (e.g., location, service of a hotel) and the relative weights reviewers have placed on each aspect based on review content and the associated overall ratings. A major limitation of previous work on LARA is the assumption of pre-specified aspects by keywords. However, the aspect information is not always available, and it may be difficult to pre-define appropriate aspects without a good knowledge about what aspects are actually commented on in the reviews.
In this paper, we propose a unified generative model for LARA, which does not need pre-specified aspect keywords and simultaneously mines 1) latent topical aspects, 2) ratings on each identified aspect, and 3) weights placed on different aspects by a reviewer. Experiment results on two different review data sets demonstrate that the proposed model can effectively perform the Latent Aspect Rating Analysis task without the supervision of aspect keywords. Because of its generality, the proposed model can be applied to explore all kinds of opinionated text data containing overall sentiment judgments and support a wide range of interesting application tasks, such as aspect-based opinion summarization, personalized entity ranking and recommendation, and reviewer behavior analysis.
- Onix text retrieval toolkit stopword list. http://www.lextek.com/manuals/onix/stopwords1.html.Google Scholar
- D. Blei and J. McAuliffe. Supervised topic models. Advances in Neural Information Processing Systems, 20:121--128, 2008.Google Scholar
- D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarDigital Library
- X. Ding, B. Liu, and L. Zhang. Entity discovery and assignment for opinion mining applications. In Proceedings of the 15th KDD, pages 1125--1134. ACM, 2009. Google ScholarDigital Library
- J. Graça, K. Ganchev, and B. Taskar. Expectation maximization and posterior constraints. In Advances in Neural Information Processing Systems, volume 20. MIT Press, 2007.Google Scholar
- M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the 10th KDD, pages 168--177. ACM, 2004. Google ScholarDigital Library
- M. Hu and B. Liu. Mining opinion features in customer reviews. In AAAI, pages 755--760. AAAI Press / The MIT Press, 2004. Google ScholarDigital Library
- N. Jindal and B. Liu. Identifying comparative sentences in text documents. In Proceedings of 29th SIGIR, 2006. Google ScholarDigital Library
- Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining, 2011. Google ScholarDigital Library
- M. Jordan, Z. Ghahramani, T. Jaakkola, and L. Saul. An introduction to variational methods for graphical models. Machine learning, 37(2):183--233, 1999. Google ScholarDigital Library
- S. Kim and E. Hovy. Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics, pages 1367--1373. Association for Computational Linguistics, 2004. Google ScholarDigital Library
- C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In Proceeding of the 18th CIKM, pages 375--384, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
- L. Lovász and M. Plummer. Matching theory. Elsevier Science Ltd, 1986.Google Scholar
- Y. Lu and C. Zhai. Opinion integration through semi-supervised topic modeling. In Proceeding of the 17th WWW, pages 121--130. ACM, 2008. Google ScholarDigital Library
- Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In Proceedings of the 18th WWW, pages 131--140, 2009. Google ScholarDigital Library
- Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th WWW, pages 171--180. ACM, 2007. Google ScholarDigital Library
- S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima. Mining product reputations on the web. In Proceeding of the 8th KDD, pages 341--349, 2002. Google ScholarDigital Library
- B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd ACL, pages 115--124, 2005. Google ScholarDigital Library
- B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, pages 79--86, 2002. Google ScholarDigital Library
- A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In HLT '05, pages 339--346, Morristown, NJ, USA, 2005. Google ScholarDigital Library
- I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of the 46th ACL, pages 308--316, 2008.Google Scholar
- I. Titov and R. T. McDonald. Modeling online reviews with multi-grain topic models. In Proceeding of the 17th WWW, pages 111--120, 2008. Google ScholarDigital Library
- H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th KDD, pages 783--792. ACM, 2010. Google ScholarDigital Library
- W. X. Zhao, J. Jiang, H. Yan, and X. Li. Jointly modeling aspects and opinions with a maxent-lda hybrid. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP '10, 2010. Google ScholarDigital Library
- L. Zhuang, F. Jing, and X.-Y. Zhu. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, 2006. Google ScholarDigital Library
Index Terms
- Latent aspect rating analysis without aspect keyword supervision
Recommendations
Latent aspect rating analysis on review text data: a rating regression approach
KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data miningIn this paper, we define and study a new opinionated text data analysis problem called Latent Aspect Rating Analysis (LARA), which aims at analyzing opinions expressed about an entity in an online review at the level of topical aspects to discover each ...
Aspect and sentiment unification model for online review analysis
WSDM '11: Proceedings of the fourth ACM international conference on Web search and data miningUser-generated reviews on the Web contain sentiments about detailed aspects of products and services. However, most of the reviews are plain text and thus require much effort to obtain information about relevant details. In this paper, we tackle the ...
Aspect-based opinion mining from product reviews
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval"What other people think" has always been an important piece of information for most of us during the decision-making process. Today people tend to make their opinions available to other people via the Internet. As a result, the Web has become an ...
Comments