Non-linguistic Features for Cyberbullying Detection on a Social Media Platform Using Machine Learning

Liu, YuYi; Zavarsky, Pavol; Malik, Yasir

doi:10.1007/978-3-030-37337-5_31

YuYi Liu¹¹,
Pavol Zavarsky¹² &
Yasir Malik¹²

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 11982))

Included in the following conference series:

International Symposium on Cyberspace Safety and Security

1150 Accesses
5 Citations

Abstract

Cyberbullying on social media platforms has been a severe problem with serious negative consequences. Therefore, a number of researches on automatic detection of cyberbullying using machine learning techniques have been conducted in recent years. While cyberbullying detection has traditionally utilized linguistic features, the cyberbullying on social media does not have only linguistic features. In this paper, a holistic multi-dimensional feature set is developed which takes into account individual-based, social network-based, episode-based and linguistic content-based cyberbullying features. To test performance of the proposed multi-dimensional feature set, we designed and built cyberbullying detection models on the KNIME machine learning platform. Six different machine learning algorithms - Naïve Bayes, Decision Tree, Random Forest, Tree Ensemble, Logistic Regression, and Support Vector Machines - were used in our cyberbullying detection models. Our experimental results demonstrate that applying the proposed multi-dimensional feature set (i.e. the set not limited to the linguistic features) results in an improved cyberbullying detection for all tested machine learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Price, M., Dalgleish, J.: Cyberbullying: experiences, impacts and coping strategies as described by Australian young people. Youth Stud. Aust. 29, 51 (2010)
Google Scholar
O’Sullivan, P.B.: Reconceptualizing ‘flaming’ and other problematic messages. New Media Soc. 5(1), 69–94 (2003)
Article Google Scholar
Vandebosch, H., van Cleemput, K.: Cyberbullying among youngsters: profiles of bullies and victims. New Media Soc. 11(8), 1349–1371 (2009)
Article Google Scholar
Willard, N.E.: Cyberbullying and Cyberthreats: Responding to the Challenge of Online Social Aggression, Threats, and Distress. Research Publishers LLC, Champaign (2007)
Google Scholar
Cowie, H.: Cyberbullying and its impact on young people’s emotional health and well-being. Psychiatrist 37(5), 167–170 (2013)
Article Google Scholar
Smith, P.K., et al.: Cyberbullying: its nature and impact in secondary school pupils. J. Child Psychol. Psychiatry Allied Discip. 49(4), 376–385 (2008)
Article Google Scholar
Slonje, R., Smith, P.K., Frisén, A.: The nature of cyberbullying, and strategies for prevention. Comput. Hum. Behav. 29(1), 26–32 (2013)
Article Google Scholar
Ghasem, Z., Frommholz, I., Maple, C.: Machine learning solutions for controlling cyberbullying and cyberstalking. J. Inf. Secur. Res. 6(2), 55–64 (2015)
Google Scholar
Galán-García, P., et al.: Supervised machine learning for detection of troll profiles in twitter social network: application to real case of cyberbullying. Log. J. IGPL 24(1), 42–53 (2015)
MathSciNet Google Scholar
Kasture, A.S., Nand, P., Tegginmath, S.: A predictive model to detect online cyberbullying (2015)
Google Scholar
Zhao, R., Zhou, A., Mao, K.: Automatic detection of cyberbullying on social networks based on bullying features. In: 17th International Conference on Computer Networks - ICDCN 2016 (2016)
Google Scholar
Engman, L., Janlert, L.E., Bjorklund, H.: Automatic detection of cyberbullying on social media. In: Proceedings of 16th International Multidisciplinary Scientific Conference SGEM 2016, pp. 505–512 (2016)
Google Scholar
Chatzakou, D., et al.: Mean birds: detecting aggression and bullying on Twitter (2017)
Google Scholar
Haidar, B., Chamoun, M., Serhrouchni, A.: A multilingual system for cyberbullying detection: Arabic content detection using machine learning. ASTES J. 2(6), 275–284 (2017)
Article Google Scholar
Agrawal, S., Awekar, A.: Deep learning for detecting cyberbullying across multiple social media platforms. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 141–153. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_11
Chapter Google Scholar
Van Hee, C., et al.: Automatic detection of cyberbullying in social media text. Plos One 1–21 (2018)
Google Scholar
Hosseinmardi, H., Mattson, S.A., Ibn Rafiq, R., Han, R., Lv, Q., Mishra, S.: Analyzing labeled cyberbullying incidents on the instagram social network. Social Informatics. LNCS, vol. 9471, pp. 49–66. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27433-1_4
Chapter Google Scholar
Hosseinmardi, H., et al.: Towards understanding cyberbullying behavior in a semi-anonymous social network. In: Proceedings of 2014 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, pp. 244–252 (2014)
Google Scholar
Hosseinmardi, H., et al.: A comparison of common users across Instagram and Ask.fm to better understand cyberbullying. In: Proceedings of 4th IEEE International Conference on Big Data and Cloud Computing, pp. 355–362 (2014)
Google Scholar
Hosseinmardi, H.: Dataset - CU Cyber Safety Research Center, Univ. Colorado at Boulder. https://sites.google.com/site/cucybersafety/home/cyberbullying-detection-project/dataset
Pieschl, S., et al.: Relevant dimensions of cyberbullying - results from two experimental studies. J. Appl. Dev. Psychol. 34(5), 241–252 (2013)
Article Google Scholar
NoSwearing.com: Swear word list, dictionary, filter, and API. https://www.noswearing.com
Hatebase. https://www.hatebase.org/
Berthold, M.R., et al.: KNIME-the Konstanz information miner: ver. 2.0 and beyond. ACM SIGKDD Explor. Newsl. 11(1), 26–31 (2009)
Article Google Scholar
Fawcett, T.: An introduction to ROC analysis. Pattern Rec. Lett. 27(8), 861–874 (2006)
Article MathSciNet Google Scholar
Textfixer.com English stop words list. https://www.textfixer.com/tutorials/common-english-words.txt

Download references

Author information

Authors and Affiliations

Edmonton Public Schools, Edmonton, Canada
YuYi Liu
Concordia University of Edmonton, Edmonton, Canada
Pavol Zavarsky & Yasir Malik

Authors

YuYi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Pavol Zavarsky
View author publications
You can also search for this author in PubMed Google Scholar
Yasir Malik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pavol Zavarsky .

Editor information

Editors and Affiliations

Rutgers University, Newark, NJ, USA
Jaideep Vaidya
Beihang University, Beijing, China
Xiao Zhang
Guangzhou University, Guangzhou, China
Jin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Y., Zavarsky, P., Malik, Y. (2019). Non-linguistic Features for Cyberbullying Detection on a Social Media Platform Using Machine Learning. In: Vaidya, J., Zhang, X., Li, J. (eds) Cyberspace Safety and Security. CSS 2019. Lecture Notes in Computer Science(), vol 11982. Springer, Cham. https://doi.org/10.1007/978-3-030-37337-5_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-37337-5_31
Published: 03 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37336-8
Online ISBN: 978-3-030-37337-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics