Abstract
Recent years have seen rapid growth of research on sentiment analysis. In aspect-based sentiment analysis, the idea is to take sentiment analysis a step further and find out what exactly someone is talking about, and then measuring the sentiment if she or he likes or dislikes it. Sentiment analysis in Bengali language is progressing and is considered as an important research interest. Due to scarcity of resources like proper annotated dataset, corpora, lexicon such as part of speech tagger etc. aspect-based sentiment analysis hardly has been done in Bengali language. In this paper, we have conducted our experiments based on a recent work from 2018 using conventional supervised machine learning algorithms (RF, SVM, KNN) to perform one of the ABSA’s tasks - aspect category extraction. The work is done on two datasets named – Cricket and Restaurant. We then compared our results with the existing work. We used two traditional steps to clean data and found that less preprocessing leads to better F1 Score. For Cricket dataset, SVM and KNN performed better, resulting F1 score of 37% and 27%. For Restaurant dataset, RF and SVM achieved improved score of 35% and 39% respectively. Additionally, we selected two more algorithms LR and NB, LR achieved best F1 score (43%) for Restaurant dataset among all.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Data never sleeps 5.0. https://www.domo.com/learn/data-never-sleeps-5
MonkeyLearn. https://monkeylearn.com/sentiment-analysis/#what-is-sentiment-analysis
Wang, B., Liu, M.: Deep learning for aspect-based sentiment analysis. Report cs224, Stanford University (2015)
Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I.: SemEval-2015 task 12: aspect based sentiment analysis. In: 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 486–495. Association for Computational Linguistics, Denver (2015). https://doi.org/10.18653/v1/s15-2082
Rahman, M.A., Dey, E.K.: Datasets for aspect-based sentiment analysis in Bangla dataset. MDPI J. 3(2), 15 (2018). https://doi.org/10.3390/data3020015
Pontiki, M., Bakagianni, J.: SemEval-2014 ABSA Test Data (Gold Annotations Corpus). http://metashare.elda.org/repository/browse/semeval-2014-absa-test-data-gold-annotations/b98d11cec18211e38229842b2b6a04d77591d40acd7542b7af823a54fb03a155/
Pontiki M., Galanis, D., Pavlopoulos, J., Papageorgiou H., Androutsopoulos I., Manandhar S.: SemEval-2014 task 4: aspect based sentiment analysis. In: 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 27–35. Association for Computational Linguistics (2014). https://doi.org/10.3115/v1/s14-2004
Hercig, T., Brychc, T., Svoboda, L., Konko, M., Konko, M.: Unsupervised methods to improve aspect-based sentiment analysis in Czech. Comput. Sist. 20(3), 365–375 (2016). https://doi.org/10.13053/cys-20-3-2469
Hasib, T., Rahin, S.A.: Apsect-based sentiment analysis using Semeval and Amazon datasets. Academic thesis Paper, BRAC University (2017)
Thet, T.T., Na, J.C., Khoo, C.S.G.: Aspect-based sentiment analysis of movie reviews on discussion. J. Inf. Sci. 36(6), 823–848 (2010). https://doi.org/10.1177/0165551510388123
Poria, S., Cambria, E., Ku, L.W., Gui, C., Gelbukh, A.: A rule-based approach to aspect extraction from product reviews. In: 2nd Workshop on Natural Language Processing for Social Media (SocialNLP), pp. 28–37. Association for Computational Linguistics and Dublin City University, Ireland (2014) https://doi.org/10.3115/v1/w14-5905
Smadi, M.A., Qawasmeh, O., Talafha, B., Quwaider, M.: Human annotated arabic dataset of book reviews for aspect-based sentiment analysis. In: 3rd International Conference on Future Internet of Things and Cloud, pp. 726–730. IEEE, Italy (2015). https://doi.org/10.1109/ficloud.2015.62
Tamchyna, A., Fiala, O., Veselovská, K.: Czech aspect-based sentiment analysis: a new dataset and preliminary results. In: Information Technology Application Theory (ITAT 2015), vol. 1422, pp. 95–99. CEUR-WS, Slovakia (2015)
Apidianaki, M., Tannier, X., Richart, C.: Datasets for aspect-based sentiment analysis in French. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp. 1122–1126. European Language Resources Association (ELRA), Portorož (2016)
Akhtar, M.S., Ekbal, A., Bhattacharyya, P.: Aspect based sentiment analysis in Hindi: resource creation and evaluation. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp. 2703–2709. European Language Resources Association, Portorož (2016)
Sklearn. https://pypi.org/project/sklearn/
Bengali Language. https://en.wikipedia.org/wiki/Bengali_language
Gentle introduction to the bag-of-words model. https://machinelearningmastery.com/gentle-introduction-bag-words-model/
Panchal, A.: Text Summarization using TF-IDF. Towards Datascience. https://towardsdatascience.com/text-summarization-using-tf-idf-e64a0644ace3
Sklearn.feature_extraction.text.TfidfVectorizer. https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html
Hamdan, H., Bellot, P., Bechet, F.: Lsislif: CRF and logistic regression for opinion target extraction and sentiment polarity analysis. In: 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 753–758. Association for Computational Linguistics, Denver (2015). https://doi.org/10.18653/v1/s15-2128
Mubarok, M.S., Adiwijaya, Aldhi. M.D.: Aspect-based sentiment analysis to review products using Naive Bayes. In: AIP Conference, vol. 1867 (2017). https://doi.org/10.1063/1.4994463
Chowdhury, S., Chowdhury, W.: Performing sentiment analysis in Bangla microblog posts. In: 2014 International Conference on Informatics, Electronics & Vision (ICIEV), pp. 1–6, IEEE, Dhaka (2014). https://doi.org/10.1109/iciev.2014.6850712
Korkmaz, M., Güney, S., Yigiter, S.Y.: The importance of logistic regression implementations in the Turkish livestock sector and logistic regression implementations/fields, Turkey (2012)
Ismail, H., Harous, S., Belkhouche, B.: A comparative analysis of machine learning classifiers for Twitter sentiment analysis. Res. Comput. Sci. 110, 71–83 (2016). https://doi.org/10.13053/rcs-110-1-6
Jurafsky, D.: Language modeling, index of class cs124/lecture. Stanford University (2018)
NLTK 3.4.4 documentation. https://www.nltk.org/. Accessed 22 May 2019
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Haque, S. et al. (2020). Aspect Based Sentiment Analysis in Bangla Dataset Based on Aspect Term Extraction. In: Bhuiyan, T., Rahman, M.M., Ali, M.A. (eds) Cyber Security and Computer Science. ICONCS 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 325. Springer, Cham. https://doi.org/10.1007/978-3-030-52856-0_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-52856-0_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-52855-3
Online ISBN: 978-3-030-52856-0
eBook Packages: Computer ScienceComputer Science (R0)