Social Media based Hate Speech Detection using Machine Learning

Authors

  • Dr. Nisha Auti  HOD, Associate Professor, Department of Computer Engineering, JSPM Narhe Technical Campus, Pune, Maharashtra, India
  • Shreeraj Ghadge  Department of Computer Engineering, JSPM Narhe Technical Campus, Pune, Maharashtra, India
  • Rajdatta Jadhav  Department of Computer Engineering, JSPM Narhe Technical Campus, Pune, Maharashtra, India
  • Prajwal Jagtap  Department of Computer Engineering, JSPM Narhe Technical Campus, Pune, Maharashtra, India
  • Sumit Ranaware  Department of Computer Engineering, JSPM Narhe Technical Campus, Pune, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT228653

Keywords:

Hate Speech, Machine Learning, Social Media,Social Network, Multi-Class Hate Speech, Natural Language Processing, Hate Speech Classification, Social Media Microblogs, Multi-Class Hate Speech Dataset, Twitter Hate Speech, Text Mining, Features Exploration

Abstract

Hate speech is a crime that has been increasing in recent years, not only in person but also online. There are several causes for this. There is tremendous growth in social media that promotes full freedom of expression through anonymity features. Freedom of expression is a human right, but hate speech directed at individuals or groups on the basis of race, caste, religion, ethnicity or nationality, gender, disability, gender identity, etc. is a violation of that sovereignty. Freedom of expression is a human right, but hate speech directed at individuals or groups on the basis of race, caste, religion, ethnicity or nationality, gender, disability, gender identity, etc. is a violation of that sovereignty. It promotes violence and hate crimes, creates social imbalances, and undermines peace, trust and human rights. Revealing hate speech in social media discourse is a very important but complex task. On the one hand, the anonymity provided by the Internet, especially social networks, makes people more likely to engage in hostile behavior. On the other hand, the desire to express one's thoughts on the Internet has increased, leading to the spread of hate speech. Governments and social media platforms can benefit from detection and prevention technologies, as this kind of bigoted language can wreak havoc on society. We help resolve this dilemma by providing a systematic overview of research on this topic in this survey. This project aims to accurately predict various forms by addressing different categories of hate individually and examining a set of text mining functions. Hate speech detection

References

  1. Muhammad Sabih “Un-Compromised Credibility: Social Media Based Multi-Class Hate Speech Classification for Text” January 2021IEEE Access 9:109465-109477 DOI:10.1109/ACCESS.2021.3101977 License CC BY-NC-ND 4.0
  2. Dris, David, Ogunseye, Elizabeth Oluyemisi and Akinola, Solomon Olalekan. (2020). “Detecting Hate Speech on social media Using Deep Learning Techniques”, University of Ibadan Journal of Science and Logics in ICT Research (UIJSLICTR), Vol. 5 No. 1, pp. 22 - 38. ©U IJSLICTR Vol. 5, No. 1, June 2020. [3] Filip Klubicka, Raquel Fernandez “Examining a hate speech corpus for hate speech detection and popularity prediction” arXiv:1805.04661v1 [cs.CL] 12 May 2018. [4] Irene Kwok and Yuzhou Wang “Locate the Hate: Detecting Tweets against Blacks”
  3. Thomas Davidson, Dana Warmsley, Michael Macy, Ingmar Weber “Automated Hate Speech Detection and the Problem of Offensive Language” arXiv:1805.04661v1 [cs.CL] 12 May 2017
  4. Mohiyaddeen, Dr. Shifaulla Siddiqui “Automatic Hate Speech Detection: A Literature Review” e-ISSN: 2250-0758 | p-ISSN: 2394-6962 Volume-11, Issue-2 (April 2021)
  5. Resmi Reghunathan, Asha A S “Hate Speech Detection in Conventional Language on Social Media by using Machine Learning” International Journal of Engineering Research & Technology) http://www.ijert.org ISSN: 2278-0181 Vol. 11 Issue 06, June-2022
  6. Areej Al-Hass an, Hmood Al-Dossari “Detection of hate speech in social networks: a survey on multilingual corpus” Conference Paper · February 2019 DOI: 10.5121/csit.2019.90208
  7. Sindhu Abro , Sarang Shaikh , Zafar Ali Sajid Khan , Ghulam Mujtaba “Automatic Hate Speech Detection Using Machine Learning: A Comparative Study” (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 11, No. 8, 2020.
  8. Pete Burnap and Matthew L. Williams “Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modelling for Policy and Decision Making .” 1944-2866 # 2015 The Authors. Policy & Internet published by Wiley Periodicals, Inc. on behalf of Policy Studies Organization.
  9. Sreelakshmi ka , Premjith Ba , Soman K.Pa “Detection of Hate Speech Text in Hindi English Code-mixed Data” Procedia Computer Science 171 (2020) 737–744 .
  10. Mathew, Binny, et al. "Analyzing the hate and counter speech accounts on twitter." arXiv preprint arXiv:1812.02712 (2018).
  11. Gaydhani, Aditya, et al. "Detecting hate speech and offensive language on twitter using machine learning: An n-gram and tfidf based approach." arXiv preprint arXiv: 1809.08651 (2018).
  12. Watanabe, Hajime, Mondher Bouazizi, and Tomoaki Ohtsuki. "Hate speech on twitter: A pragmatic approach to collect hateful and offensive expressions and perform hate speech detection." IEEE access 6 (2018): 13825-13835.
  13. Wich, Maximilian, Jan Bauer, and Georg Groh. "Impact of politically biased data on hate speech classification." Proceedings of the Fourth Workshop on Online Abuse and Harms. 2020.

Downloads

Published

2022-12-30

Issue

Section

Research Articles

How to Cite

[1]
Dr. Nisha Auti, Shreeraj Ghadge, Rajdatta Jadhav, Prajwal Jagtap, Sumit Ranaware, " Social Media based Hate Speech Detection using Machine Learning, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 6, pp.443-450, November-December-2022. Available at doi : https://doi.org/10.32628/CSEIT228653