Skip to main content

Non-linguistic Features for Cyberbullying Detection on a Social Media Platform Using Machine Learning

  • Conference paper
  • First Online:
Cyberspace Safety and Security (CSS 2019)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 11982))

Included in the following conference series:

Abstract

Cyberbullying on social media platforms has been a severe problem with serious negative consequences. Therefore, a number of researches on automatic detection of cyberbullying using machine learning techniques have been conducted in recent years. While cyberbullying detection has traditionally utilized linguistic features, the cyberbullying on social media does not have only linguistic features. In this paper, a holistic multi-dimensional feature set is developed which takes into account individual-based, social network-based, episode-based and linguistic content-based cyberbullying features. To test performance of the proposed multi-dimensional feature set, we designed and built cyberbullying detection models on the KNIME machine learning platform. Six different machine learning algorithms - Naïve Bayes, Decision Tree, Random Forest, Tree Ensemble, Logistic Regression, and Support Vector Machines - were used in our cyberbullying detection models. Our experimental results demonstrate that applying the proposed multi-dimensional feature set (i.e. the set not limited to the linguistic features) results in an improved cyberbullying detection for all tested machine learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Price, M., Dalgleish, J.: Cyberbullying: experiences, impacts and coping strategies as described by Australian young people. Youth Stud. Aust. 29, 51 (2010)

    Google Scholar 

  2. O’Sullivan, P.B.: Reconceptualizing ‘flaming’ and other problematic messages. New Media Soc. 5(1), 69–94 (2003)

    Article  Google Scholar 

  3. Vandebosch, H., van Cleemput, K.: Cyberbullying among youngsters: profiles of bullies and victims. New Media Soc. 11(8), 1349–1371 (2009)

    Article  Google Scholar 

  4. Willard, N.E.: Cyberbullying and Cyberthreats: Responding to the Challenge of Online Social Aggression, Threats, and Distress. Research Publishers LLC, Champaign (2007)

    Google Scholar 

  5. Cowie, H.: Cyberbullying and its impact on young people’s emotional health and well-being. Psychiatrist 37(5), 167–170 (2013)

    Article  Google Scholar 

  6. Smith, P.K., et al.: Cyberbullying: its nature and impact in secondary school pupils. J. Child Psychol. Psychiatry Allied Discip. 49(4), 376–385 (2008)

    Article  Google Scholar 

  7. Slonje, R., Smith, P.K., Frisén, A.: The nature of cyberbullying, and strategies for prevention. Comput. Hum. Behav. 29(1), 26–32 (2013)

    Article  Google Scholar 

  8. Ghasem, Z., Frommholz, I., Maple, C.: Machine learning solutions for controlling cyberbullying and cyberstalking. J. Inf. Secur. Res. 6(2), 55–64 (2015)

    Google Scholar 

  9. Galán-García, P., et al.: Supervised machine learning for detection of troll profiles in twitter social network: application to real case of cyberbullying. Log. J. IGPL 24(1), 42–53 (2015)

    MathSciNet  Google Scholar 

  10. Kasture, A.S., Nand, P., Tegginmath, S.: A predictive model to detect online cyberbullying (2015)

    Google Scholar 

  11. Zhao, R., Zhou, A., Mao, K.: Automatic detection of cyberbullying on social networks based on bullying features. In: 17th International Conference on Computer Networks - ICDCN 2016 (2016)

    Google Scholar 

  12. Engman, L., Janlert, L.E., Bjorklund, H.: Automatic detection of cyberbullying on social media. In: Proceedings of 16th International Multidisciplinary Scientific Conference SGEM 2016, pp. 505–512 (2016)

    Google Scholar 

  13. Chatzakou, D., et al.: Mean birds: detecting aggression and bullying on Twitter (2017)

    Google Scholar 

  14. Haidar, B., Chamoun, M., Serhrouchni, A.: A multilingual system for cyberbullying detection: Arabic content detection using machine learning. ASTES J. 2(6), 275–284 (2017)

    Article  Google Scholar 

  15. Agrawal, S., Awekar, A.: Deep learning for detecting cyberbullying across multiple social media platforms. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 141–153. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_11

    Chapter  Google Scholar 

  16. Van Hee, C., et al.: Automatic detection of cyberbullying in social media text. Plos One 1–21 (2018)

    Google Scholar 

  17. Hosseinmardi, H., Mattson, S.A., Ibn Rafiq, R., Han, R., Lv, Q., Mishra, S.: Analyzing labeled cyberbullying incidents on the instagram social network. Social Informatics. LNCS, vol. 9471, pp. 49–66. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27433-1_4

    Chapter  Google Scholar 

  18. Hosseinmardi, H., et al.: Towards understanding cyberbullying behavior in a semi-anonymous social network. In: Proceedings of 2014 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, pp. 244–252 (2014)

    Google Scholar 

  19. Hosseinmardi, H., et al.: A comparison of common users across Instagram and Ask.fm to better understand cyberbullying. In: Proceedings of 4th IEEE International Conference on Big Data and Cloud Computing, pp. 355–362 (2014)

    Google Scholar 

  20. Hosseinmardi, H.: Dataset - CU Cyber Safety Research Center, Univ. Colorado at Boulder. https://sites.google.com/site/cucybersafety/home/cyberbullying-detection-project/dataset

  21. Pieschl, S., et al.: Relevant dimensions of cyberbullying - results from two experimental studies. J. Appl. Dev. Psychol. 34(5), 241–252 (2013)

    Article  Google Scholar 

  22. NoSwearing.com: Swear word list, dictionary, filter, and API. https://www.noswearing.com

  23. Hatebase. https://www.hatebase.org/

  24. Berthold, M.R., et al.: KNIME-the Konstanz information miner: ver. 2.0 and beyond. ACM SIGKDD Explor. Newsl. 11(1), 26–31 (2009)

    Article  Google Scholar 

  25. Fawcett, T.: An introduction to ROC analysis. Pattern Rec. Lett. 27(8), 861–874 (2006)

    Article  MathSciNet  Google Scholar 

  26. Textfixer.com English stop words list. https://www.textfixer.com/tutorials/common-english-words.txt

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pavol Zavarsky .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, Y., Zavarsky, P., Malik, Y. (2019). Non-linguistic Features for Cyberbullying Detection on a Social Media Platform Using Machine Learning. In: Vaidya, J., Zhang, X., Li, J. (eds) Cyberspace Safety and Security. CSS 2019. Lecture Notes in Computer Science(), vol 11982. Springer, Cham. https://doi.org/10.1007/978-3-030-37337-5_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-37337-5_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-37336-8

  • Online ISBN: 978-3-030-37337-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics