BERT-BASED DETECTION OF CYBERBULLYING IN ONLINE TEXTS

Authors: Amrutha Muralidhar
Affiliation: B. M. S College of Engineering, Bangalore

Category:

Keywords: Cyberbullying, Online Safety, Sentiment Analysis, Deep Learning, Text Classification
ABSTRACT. Social media has experienced exponential growth in recent years, becoming integral to daily communication and interaction. However, along with this growth, cyberbullying has emerged as a significant issue, causing harm and distress to individuals online. This paper investigates the effectiveness of utilizing BERT-based models for identifying cyberbullying behavior in online text. A BERT classifier was trained on a labeled dataset containing instances of cyberbullying and assessed for its performance in accurately detecting such behavior. Results indicate that the BERT classifier achieves a strong accuracy rate of 94% on the test dataset. These findings suggest the potential of BERT-based models in bolstering online safety efforts and combating cyberbullying. The aim of this study is to contribute to the advancement of tools aimed at fostering digital well-being and cultivating safer online communities

References:

Ani Petrosyan. 2024. “Worldwide Digital Population 2024.” Statista. May 7, 2024. Accessed May 8, 2024. https://www.statista.com/statistics/617136/digital-population-worldwide/
Auxier, Brooke, and Monica Anderson. "Social media use in 2021." Pew Research Center 1, no. 1 (2021): 1-4
Craig, Wendy, Meyran Boniel-Nissim, Nathan King, Sophie D. Walsh, Maartje Boer, Peter D. Donnelly, Yossi Harel-Fisch et al. "Social media use and cyber-bullying: A cross-national analysis of young people in 42 countries." Journal of Adolescent Health 66, no. 6 (2020): S100-S108. https://doi.org/10.1016/j.jadohealth.2020.03.006
Horner, Stacy, Yvonne Asher, and Gary D. Fireman. "The impact and response to electronic bullying and traditional bullying among adolescents." Computers in human behavior 49 (2015): 288-295. https://doi.org/10.1016/j.chb.2015.03.007
Camerini, Anne-Linda, Laura Marciano, Anna Carrara, and Peter Schulz. ‘Cyberbullying Perpetration and Victimization among Children and Adolescents: A Systematic Review of Longitudinal Studies’. Telematics and Informatics 49 (06 2020): 101362. https://doi.org/10.1016/j.tele.2020.101362
Calpbinici, Pelin, and Fatma Tas Arslan. "Virtual behaviors affecting adolescent mental health: The usage of Internet and mobile phone and cyberbullying." Journal of Child and Adolescent Psychiatric Nursing 32, no. 3 (2019): 139-148
United Nations Children’s Fund (UNICEF). 2020. “Children at Increased Risk of Harm Online During Global COVID-19 Pandemic.” Unicef.Org. April 14, 2020. Accessed May 8, 2024. https://www.unicef.org/press-releases/children-increased-risk-harm-online-during-global-covid-19-pandemic
Ganson, Kyle T., Nelson Pang, Jason M. Nagata, Catrin Pedder Penn-Jones, Faye Mishna, Alexander Testa, Dylan B. Jackson, and David Hammond. 2024. “Screen Time, Social Media Use, and Weight-related Bullying Victimization: Findings From an International Sample of Adolescents.” PloS One 19 (4): e0299830. https://doi.org/10.1371/journal.pone.0299830
Chavan, V. S., & Shylaja, S. S. "Machine learning approach for detection of cyber-aggressive comments by peers on social media network." In 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 2354-2358. Kochi, India, 2015. DOI: 10.1109/ICACCI.2015.7275970
Chen, Y., Zhou, Y., Zhu, S., & Xu, H. "Detecting Offensive Language in Social Media to Protect Adolescent Online Safety." In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing, pp. 71-80. Amsterdam, Netherlands, 2012. DOI: 10.1109/SocialCom-PASSAT.2012.55
Özel, S. A., Saraç, E., Akdemir, S., & Aksu, H. "Detection of cyberbullying on social media messages in Turkish." In 2017 International Conference on Computer Science and Engineering (UBMK), pp. 366-370. Antalya, Turkey, 2017. DOI: 10.1109/UBMK.2017.8093411
Yadav, J., Kumar, D., & Chauhan, D. "Cyberbullying Detection using Pre-Trained BERT Model." In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), pp. 1096-1100. Coimbatore, India, 2020. DOI: 10.1109/ICESC48915.2020.9155700
Basak, R., Sural, S., Ganguly, N., & Ghosh, S. K. "Online Public Shaming on Twitter: Detection, Analysis, and Mitigation." In IEEE Transactions on Computational Social Systems, vol. 6, no. 2, pp. 208-220, April 2019. DOI: 10.1109/TCSS.2019.2895734
Watanabe, H., Bouazizi, M., & Ohtsuki, T. "Hate Speech on Twitter: A Pragmatic Approach to Collect Hateful and Offensive Expressions and Perform Hate Speech Detection." In IEEE Access, vol. 6, pp. 13825-13835, 2018. DOI: 10.1109/ACCESS.2018.2806394
Roy, P. K., Tripathy, A. K., Das, T. K., & Gao, X.-Z. "A Framework for Hate Speech Detection Using Deep Convolutional Neural Network." In IEEE Access, vol. 8, pp. 204951-204962, 2020. DOI: 10.1109/ACCESS.2020.3037073
Martins, R., Gomes, M., Almeida, J. J., Novais, P., & Henriques, P. "Hate Speech Classification in Social Media Using Emotional Analysis." In 2018 7th Brazilian Conference on Intelligent Systems (BRACIS), pp. 61-66. Sao Paulo, Brazil, 2018. DOI: 10.1109/BRACIS.2018.00019
Alam, K. S., Bhowmik, S., & Prosun, P. R. K. (2021). Cyberbullying Detection: An Ensemble Based Machine Learning Approach. In 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV) (pp. 710-715). Tirunelveli, India. DOI: 10.1109/ICICV50876.2021.9388499
Rodríguez, A., Argueta, C., & Chen, Y.-L. (2019). Automatic Detection of Hate Speech on Facebook Using Sentiment and Emotion Analysis. In 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) (pp. 169-174). Okinawa, Japan. DOI: 10.1109/ICAIIC.2019.8669073
Zhou, Y., Yang, Y., Liu, H., Liu, X., & Savage, N. (2020). Deep Learning Based Fusion Approach for Hate Speech Detection. IEEE Access, 8, 128923-128929. DOI: 10.1109/ACCESS.2020.3009244
Akter, M. S., Shahriar, H., Ahmed, N., & Cuzzocrea, A. (2022). Deep Learning Approach for Classifying Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data. 2022 IEEE International Conference on Big Data (Big Data), 5646-5655. doi: 10.1109/BigData55660.2022.10020249