skip to main content
10.1145/3417113.3423377acmconferencesArticle/Chapter ViewAbstractPublication PagesaseConference Proceedingsconference-collections
research-article

Roman Urdu reviews dataset for aspect based opinion mining

Published:22 January 2021Publication History

ABSTRACT

Social media, today, demonstrates the rapid growth of modern society as it becomes the main platform for Internet users to communicate and express themselves. People around the world, use a number of devices and resources to access the Internet, set up social networks, conduct online business, e-commerce, e-surveys, etc. Currently, social media is not only a technology that provides information to consumers, it also encourages users to connect and share their views and perspectives. It leads to an increase in inspiration towards Opinion Mining (OM), which is important for both customers and companies in making decisions. Individuals like to see the opinions provided by other customers about a particular product or a service. Companies need to analyze their customer's feedback to strengthen their business decisions. A lot of research has been performed in various languages in the field of Aspect Based OM (ABOM). However, there are still certain languages that need to be explored, such as Roman Urdu (RU). This paper presents a proposed reviews data-set (a RU data-set) of mobile reviews that has been manually annotated with multi-aspect sentiment labels at the sentence-level. It presents base-line results using different Machine Learning (ML) algorithms. The results demonstrate 71% F1-score for aspect detection and 64% for aspect-based polarity.

References

  1. Muhammad Umair Arshad, Muhammad Farrukh Bashir, Adil Majeed, Waseem Shahzad, and Mirza Omer Beg. 2019. Corpus for Emotion Detection on Roman Urdu. In 2019 22nd International Multitopic Conference (INMIC). IEEE, 1--6.Google ScholarGoogle Scholar
  2. Mubashar Nazar Awan and Mirza O Beg. 2020. TOP-Rank: A TopicalPostionRank for Extraction and Classification of Keyphrases in Text. Computer Speech & Language (2020), 101116.Google ScholarGoogle Scholar
  3. Muhammad Bilal, Huma Israr, Muhammad Shahid, and Amin Khan. 2016. Sentiment classification of Roman-Urdu opinions using Naïve Bayesian, Decision Tree and KNN classification techniques. Journal of King Saud University-Computer and Information Sciences 28, 3 (2016), 330--344.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Shivangi Chawla and Monica Mehrotra. 2018. An Ensemble-Classifier Based Approach for Multiclass Emotion Classification of Short Text. In 2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO). IEEE, 768--774.Google ScholarGoogle Scholar
  5. Misbah Daud, Rafiullah Khan, Aitazaz Daud, et al. 2015. Roman Urdu opinion mining system (RUOMiS). arXiv preprint arXiv:1501.01386 (2015).Google ScholarGoogle Scholar
  6. Kushal Dave, Steve Lawrence, and David M Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th international conference on World Wide Web. 519--528.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cut Fiarni, Herastia Maharani, and Rino Pratama. 2016. Sentiment analysis system for Indonesia online retail shop review using hierarchy Naive Bayes technique. In 2016 4th international conference on information and communication technology (ICoICT). IEEE, 1--6.Google ScholarGoogle ScholarCross RefCross Ref
  8. Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. 168--177.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Salud María Jiménez-Zafra, Giacomo Berardi, Andrea Esuli, Diego Marcheggiani, M Teresa Martín-Valdivia, and Alejandro Moreo Fernández. 2015. A multi-lingual annotated dataset for aspect-oriented opinion mining. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2533--2538.Google ScholarGoogle ScholarCross RefCross Ref
  10. Soo-Min Kim and Eduard Hovy. 2006. Extracting opinions, opinion holders, and topics expressed in online news media text. In Proceedings of the Workshop on Sentiment and Subjectivity in Text. 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Qian Liu, Zhiqiang Gao, Bing Liu, and Yuanlin Zhang. 2015. Automated rule selection for aspect extraction in opinion mining. In Twenty-Fourth International Joint Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  12. Neelam Mukhtar and Mohammad Abid Khan. 2019. Effective lexicon-based approach for Urdu sentiment analysis. Artificial Intelligence Review (2019), 1--28.Google ScholarGoogle Scholar
  13. Bilal Naeem, Aymen Khan, Mirza Omer Beg, and Hasan Mujtaba. 2020. A deep learning framework for clickbait detection on social area network using natural language cues. Journal of Computational Social Science (2020), 1--13.Google ScholarGoogle Scholar
  14. Samuel Pecar. 2018. Towards opinion summarization of customer reviews. In Proceedings of ACL 2018, Student Research Workshop. 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  15. Ana-Maria Popescu and Orena Etzioni. 2007. Extracting product features and opinions from reviews. In Natural language processing and text mining. Springer, 9--28.Google ScholarGoogle Scholar
  16. Andi Suciati and Indra Budi. 2019. Aspect-based Opinion Mining for Code-Mixed Restaurant Reviews in Indonesia. In 2019 International Conference on Asian Language Processing(IALP). IEEE, 59--64.Google ScholarGoogle Scholar

Index Terms

  1. Roman Urdu reviews dataset for aspect based opinion mining

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ASE '20: Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering
      September 2020
      195 pages
      ISBN:9781450381284
      DOI:10.1145/3417113

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 January 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate82of337submissions,24%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader