skip to main content
10.1145/2020408.2020505acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Latent aspect rating analysis without aspect keyword supervision

Authors Info & Claims
Published:21 August 2011Publication History

ABSTRACT

Mining detailed opinions buried in the vast amount of review text data is an important, yet quite challenging task with widespread applications in multiple domains. Latent Aspect Rating Analysis (LARA) refers to the task of inferring both opinion ratings on topical aspects (e.g., location, service of a hotel) and the relative weights reviewers have placed on each aspect based on review content and the associated overall ratings. A major limitation of previous work on LARA is the assumption of pre-specified aspects by keywords. However, the aspect information is not always available, and it may be difficult to pre-define appropriate aspects without a good knowledge about what aspects are actually commented on in the reviews.

In this paper, we propose a unified generative model for LARA, which does not need pre-specified aspect keywords and simultaneously mines 1) latent topical aspects, 2) ratings on each identified aspect, and 3) weights placed on different aspects by a reviewer. Experiment results on two different review data sets demonstrate that the proposed model can effectively perform the Latent Aspect Rating Analysis task without the supervision of aspect keywords. Because of its generality, the proposed model can be applied to explore all kinds of opinionated text data containing overall sentiment judgments and support a wide range of interesting application tasks, such as aspect-based opinion summarization, personalized entity ranking and recommendation, and reviewer behavior analysis.

References

  1. Onix text retrieval toolkit stopword list. http://www.lextek.com/manuals/onix/stopwords1.html.Google ScholarGoogle Scholar
  2. D. Blei and J. McAuliffe. Supervised topic models. Advances in Neural Information Processing Systems, 20:121--128, 2008.Google ScholarGoogle Scholar
  3. D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. X. Ding, B. Liu, and L. Zhang. Entity discovery and assignment for opinion mining applications. In Proceedings of the 15th KDD, pages 1125--1134. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Graça, K. Ganchev, and B. Taskar. Expectation maximization and posterior constraints. In Advances in Neural Information Processing Systems, volume 20. MIT Press, 2007.Google ScholarGoogle Scholar
  6. M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the 10th KDD, pages 168--177. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Hu and B. Liu. Mining opinion features in customer reviews. In AAAI, pages 755--760. AAAI Press / The MIT Press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Jindal and B. Liu. Identifying comparative sentences in text documents. In Proceedings of 29th SIGIR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Jordan, Z. Ghahramani, T. Jaakkola, and L. Saul. An introduction to variational methods for graphical models. Machine learning, 37(2):183--233, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Kim and E. Hovy. Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics, pages 1367--1373. Association for Computational Linguistics, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In Proceeding of the 18th CIKM, pages 375--384, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Lovász and M. Plummer. Matching theory. Elsevier Science Ltd, 1986.Google ScholarGoogle Scholar
  14. Y. Lu and C. Zhai. Opinion integration through semi-supervised topic modeling. In Proceeding of the 17th WWW, pages 121--130. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In Proceedings of the 18th WWW, pages 131--140, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th WWW, pages 171--180. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima. Mining product reputations on the web. In Proceeding of the 8th KDD, pages 341--349, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd ACL, pages 115--124, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, pages 79--86, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In HLT '05, pages 339--346, Morristown, NJ, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of the 46th ACL, pages 308--316, 2008.Google ScholarGoogle Scholar
  22. I. Titov and R. T. McDonald. Modeling online reviews with multi-grain topic models. In Proceeding of the 17th WWW, pages 111--120, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th KDD, pages 783--792. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. W. X. Zhao, J. Jiang, H. Yan, and X. Li. Jointly modeling aspects and opinions with a maxent-lda hybrid. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP '10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. L. Zhuang, F. Jing, and X.-Y. Zhu. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Latent aspect rating analysis without aspect keyword supervision

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2011
      1446 pages
      ISBN:9781450308137
      DOI:10.1145/2020408

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 August 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader