skip to main content
10.1145/1718487.1718502acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon

Authors Info & Claims
Published:04 February 2010Publication History

ABSTRACT

Domain-oriented sentiment lexicons are widely used for fine-grained sentiment analysis on reviews; therefore, the automatic construction of domain-oriented sentiment lexicon is a fundamental and important task for sentiment analysis research. Most of existing construction approaches take only the kind of relationships between words into account, which makes them have a lot of room for improvement. This paper proposes an adapted information bottleneck method for the construction of domain-oriented sentiment lexicon. This approach can naturally make full use of the mutual reinforcement between documents and words by fusing three kinds of relationships either from words to documents or from words to words; either homogeneous or heterogeneous; either within-domain or cross-domain. The experimental results demonstrate that proposed method could dramatically improve the accuracy of the baseline approach on the construction of out-of-domain sentiment lexicon.

References

  1. A. Andreevskaia and S. Bergler. 2008. When Specialists and Generalists Work Together: Overcoming Domain Dependence in Sentiment Tagging. In Proceedings of ACL-08: HLT.Google ScholarGoogle Scholar
  2. A. Aue and M. Gamon. 2005. Customizing Sentiment Classifiers to New Domains: a Case Study. In Proceedings of RANLP.Google ScholarGoogle Scholar
  3. J. Cohen. 1960. A coefficient of agreement for nominal scales. In: Educational and Psychological measurements 20, pp. 37--46.Google ScholarGoogle Scholar
  4. T. Cover and J. Thomas. 1991. Elements of Information Theory. John Wiley & Sons, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Esuli and F. Sebastiani. 2005. Determining the semantic orientation of terms throush gloss classification. In Proceedings of CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Esuli and F. Sebastiani. 2006. SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining. In Proceedings of LREC.Google ScholarGoogle Scholar
  7. M. Gamon and A. Aue. 2005. Automatic identification of sentiment vocabulary exploiting low association with known sentiment terms. In Proceedings of ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. V. Hatzivassiloglous and K. McKeown. 1997. Predicting the semantic orientation of adjectives. In Proceedings of ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Hu and B. Liu. 2004. Mining and summarizing customer reviews. In Proceedings of KDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Kamps, M. Marx, R. Mokken, and M. Rijke. 2004. Using WordNet to measure semantic orientation of adjectives. In Proceedings of LREC.Google ScholarGoogle Scholar
  11. H. Kanayama, T. Nasukawa. 2006. Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis. In Proceedings of EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Kim and E. Hovy. 2004. Determining the sentiment of opinions. In Proceedings of COLING. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. Pang, L. Lee and S. Vaithyanathan. 2002. Thumbs up? Sentiment Classification using Machine Learning Techniques. In Proceedings of EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Popescu and O. Etzioni. 2005. Extracting product features and opinions from reviews. In Proceedings of HLT/EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. N. Slonim, N. Tishby. 1999. Agglomerative information bottleneck. In Proceedings of NIPS.Google ScholarGoogle Scholar
  16. H. Takamura, T. Inui, M. Okumura. 2005. Extracting Semantic Orientations of Words using Spin Model. In Proceedings of ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Turney. 2002. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. Turney and M. Littman. 2003. Measuring Praise and Criticism: Inference of Semantic Orientation from Association. In: ACM Transactions on Information Systems, 21(4): 315--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Wiebe, T. Wilson and M. Bell. 2001. Identifying Collocations for Recognizing Opinions. In Proceedings of the ACL/EACL Workshop on Collocation.Google ScholarGoogle Scholar
  20. H. Yu and V. Hatzivassiloglou. 2003. Towards Answering opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences. In Proceedings of EMNLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. V. Stoyanov and C. Cardie. 2008. Topic Identification for Fine-Grained Opinion Analysis. In Proceedings of Coling. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Tang, S. Tan and X. Cheng. 2009. A Survey on Sentiment Detection of Reviews. Expert Systems with Applications. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Tan, G. Wu, H. Tang and X. Cheng. 2007. A novel scheme for domain-transfer problem in the context of sentiment analysis. In Proceedings of CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Tan, X. Cheng, Y. Wang, H. Xu. 2009. Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis. In Proceedings of ECIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Q. Wu, S. Tan, H. Zhai, G. Zhang, M. Duan and X. Cheng. 2009. SentiRank: Cross-Domain Graph Ranking for Sentiment Classification. In Proceedings of WI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. W. Du, S. Tan. 2009. Building Domain-oriented Sentiment Lexicon by Improved Information Bottleneck. In Proceedings of CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WSDM '10: Proceedings of the third ACM international conference on Web search and data mining
        February 2010
        468 pages
        ISBN:9781605588896
        DOI:10.1145/1718487

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 February 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate498of2,863submissions,17%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader