Skip to main content

Aspect Based Sentiment Analysis in Bangla Dataset Based on Aspect Term Extraction

  • Conference paper
  • First Online:
Cyber Security and Computer Science (ICONCS 2020)

Abstract

Recent years have seen rapid growth of research on sentiment analysis. In aspect-based sentiment analysis, the idea is to take sentiment analysis a step further and find out what exactly someone is talking about, and then measuring the sentiment if she or he likes or dislikes it. Sentiment analysis in Bengali language is progressing and is considered as an important research interest. Due to scarcity of resources like proper annotated dataset, corpora, lexicon such as part of speech tagger etc. aspect-based sentiment analysis hardly has been done in Bengali language. In this paper, we have conducted our experiments based on a recent work from 2018 using conventional supervised machine learning algorithms (RF, SVM, KNN) to perform one of the ABSA’s tasks - aspect category extraction. The work is done on two datasets named – Cricket and Restaurant. We then compared our results with the existing work. We used two traditional steps to clean data and found that less preprocessing leads to better F1 Score. For Cricket dataset, SVM and KNN performed better, resulting F1 score of 37% and 27%. For Restaurant dataset, RF and SVM achieved improved score of 35% and 39% respectively. Additionally, we selected two more algorithms LR and NB, LR achieved best F1 score (43%) for Restaurant dataset among all.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Data never sleeps 5.0. https://www.domo.com/learn/data-never-sleeps-5

  2. MonkeyLearn. https://monkeylearn.com/sentiment-analysis/#what-is-sentiment-analysis

  3. Wang, B., Liu, M.: Deep learning for aspect-based sentiment analysis. Report cs224, Stanford University (2015)

    Google Scholar 

  4. Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I.: SemEval-2015 task 12: aspect based sentiment analysis. In: 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 486–495. Association for Computational Linguistics, Denver (2015). https://doi.org/10.18653/v1/s15-2082

  5. Rahman, M.A., Dey, E.K.: Datasets for aspect-based sentiment analysis in Bangla dataset. MDPI J. 3(2), 15 (2018). https://doi.org/10.3390/data3020015

  6. Pontiki, M., Bakagianni, J.: SemEval-2014 ABSA Test Data (Gold Annotations Corpus). http://metashare.elda.org/repository/browse/semeval-2014-absa-test-data-gold-annotations/b98d11cec18211e38229842b2b6a04d77591d40acd7542b7af823a54fb03a155/

  7. Pontiki M., Galanis, D., Pavlopoulos, J., Papageorgiou H., Androutsopoulos I., Manandhar S.: SemEval-2014 task 4: aspect based sentiment analysis. In: 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 27–35. Association for Computational Linguistics (2014). https://doi.org/10.3115/v1/s14-2004

  8. Hercig, T., Brychc, T., Svoboda, L., Konko, M., Konko, M.: Unsupervised methods to improve aspect-based sentiment analysis in Czech. Comput. Sist. 20(3), 365–375 (2016). https://doi.org/10.13053/cys-20-3-2469

  9. Hasib, T., Rahin, S.A.: Apsect-based sentiment analysis using Semeval and Amazon datasets. Academic thesis Paper, BRAC University (2017)

    Google Scholar 

  10. Thet, T.T., Na, J.C., Khoo, C.S.G.: Aspect-based sentiment analysis of movie reviews on discussion. J. Inf. Sci. 36(6), 823–848 (2010). https://doi.org/10.1177/0165551510388123

    Article  Google Scholar 

  11. Poria, S., Cambria, E., Ku, L.W., Gui, C., Gelbukh, A.: A rule-based approach to aspect extraction from product reviews. In: 2nd Workshop on Natural Language Processing for Social Media (SocialNLP), pp. 28–37. Association for Computational Linguistics and Dublin City University, Ireland (2014) https://doi.org/10.3115/v1/w14-5905

  12. Smadi, M.A., Qawasmeh, O., Talafha, B., Quwaider, M.: Human annotated arabic dataset of book reviews for aspect-based sentiment analysis. In: 3rd International Conference on Future Internet of Things and Cloud, pp. 726–730. IEEE, Italy (2015). https://doi.org/10.1109/ficloud.2015.62

  13. Tamchyna, A., Fiala, O., Veselovská, K.: Czech aspect-based sentiment analysis: a new dataset and preliminary results. In: Information Technology Application Theory (ITAT 2015), vol. 1422, pp. 95–99. CEUR-WS, Slovakia (2015)

    Google Scholar 

  14. Apidianaki, M., Tannier, X., Richart, C.: Datasets for aspect-based sentiment analysis in French. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp. 1122–1126. European Language Resources Association (ELRA), Portorož (2016)

    Google Scholar 

  15. Akhtar, M.S., Ekbal, A., Bhattacharyya, P.: Aspect based sentiment analysis in Hindi: resource creation and evaluation. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp. 2703–2709. European Language Resources Association, Portorož (2016)

    Google Scholar 

  16. Sklearn. https://pypi.org/project/sklearn/

  17. Bengali Language. https://en.wikipedia.org/wiki/Bengali_language

  18. Gentle introduction to the bag-of-words model. https://machinelearningmastery.com/gentle-introduction-bag-words-model/

  19. Panchal, A.: Text Summarization using TF-IDF. Towards Datascience. https://towardsdatascience.com/text-summarization-using-tf-idf-e64a0644ace3

  20. Sklearn.feature_extraction.text.TfidfVectorizer. https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html

  21. Hamdan, H., Bellot, P., Bechet, F.: Lsislif: CRF and logistic regression for opinion target extraction and sentiment polarity analysis. In: 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 753–758. Association for Computational Linguistics, Denver (2015). https://doi.org/10.18653/v1/s15-2128

  22. Mubarok, M.S., Adiwijaya, Aldhi. M.D.: Aspect-based sentiment analysis to review products using Naive Bayes. In: AIP Conference, vol. 1867 (2017). https://doi.org/10.1063/1.4994463

  23. Chowdhury, S., Chowdhury, W.: Performing sentiment analysis in Bangla microblog posts. In: 2014 International Conference on Informatics, Electronics & Vision (ICIEV), pp. 1–6, IEEE, Dhaka (2014). https://doi.org/10.1109/iciev.2014.6850712

  24. Korkmaz, M., Güney, S., Yigiter, S.Y.: The importance of logistic regression implementations in the Turkish livestock sector and logistic regression implementations/fields, Turkey (2012)

    Google Scholar 

  25. Ismail, H., Harous, S., Belkhouche, B.: A comparative analysis of machine learning classifiers for Twitter sentiment analysis. Res. Comput. Sci. 110, 71–83 (2016). https://doi.org/10.13053/rcs-110-1-6

    Article  Google Scholar 

  26. Jurafsky, D.: Language modeling, index of class cs124/lecture. Stanford University (2018)

    Google Scholar 

  27. NLTK 3.4.4 documentation. https://www.nltk.org/. Accessed 22 May 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Sabrina Haque , Tasnim Rahman or Asif Khan Shakir .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Haque, S. et al. (2020). Aspect Based Sentiment Analysis in Bangla Dataset Based on Aspect Term Extraction. In: Bhuiyan, T., Rahman, M.M., Ali, M.A. (eds) Cyber Security and Computer Science. ICONCS 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 325. Springer, Cham. https://doi.org/10.1007/978-3-030-52856-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-52856-0_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-52855-3

  • Online ISBN: 978-3-030-52856-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics