Published February 28, 2025 | Version CC-BY-NC-ND 4.0
Journal article Open

An Ensemble Learning Framework for Robust Cyberbullying Detection on Social Media

  • 1. Department of Computer Science and Engineering, Osmania University, Hyderabad (Telangana), India.
  • 1. Department of Computer Science and Engineering, Osmania University, Hyderabad (Telangana), India.

Description

Abstract: Social networking platforms on the Internet are now an essential feature of daily life worldwide, as these networks have made bridging the gap and sharing content an effortless task. Twitter stands out as a leading platform with a gigantic user base and is used extensively for communication between people and spreading information. Besides the many advantages these websites offer, such as promoting worldwide communication and dialogue, they may also pose unintended side effects that can be destructive to humanitarian and social life. One of the negative impacts of social networking sites is cyberbullying. Cyberbullying can be defined as “willful and repeated harm inflicted through the medium of electronic text” [1]. The support of harmful actions, such as harassment, threats, and humiliation, by individuals in online environments has brought about significant emotional and psychological effects for targeted individuals. The anonymity associated with social media platforms has the effect of increasing the occurrence of such detrimental activities, as there is less fear of the consequences of their actions, thus escalating the negative impact of cyberbullying. The Cyberbullying Detection Algorithm, a unique research approach, is used to combat the increasing problem of cyberbullying through ensemble-based learning algorithms, achieving a set of features for the Twitter dataset using machine learning techniques. This algorithm will look down on user-generated tweets in real time and discover patterns that may indicate cyberbullying behaviour. The role of the framework is to make the cyberbullying detection model on Internet platforms such as Twitter more accountable and effective through a mix of Machine Learning algorithms such as Random Forest, BERT, LSTM, and Ensemble. Our findings from an evaluative study of the critical features extracted from the Twitter dataset showed their relevance in cyberbullying detection. The performance evaluation based on key metrics such as F1 Score, Accuracy, AUC, and Precision depicts how the detection of cyberbullying can be made more effective and efficient by utilising machine learning algorithms that can detect online harassment and create a secure digital space for everyone.

Files

C456114030225.pdf

Files (745.7 kB)

Name Size Download all
md5:3ad6da493875bb4e6c53385bba39681a
745.7 kB Preview Download

Additional details

Identifiers

Dates

Accepted
2025-02-15
Manuscript received on 30 September 2024 | First Revised Manuscript received on 16 October 2024 | Second Revised Manuscript received on 19 December 2024 | Manuscript Accepted on 15 February 2025 | Manuscript published on 28 February 2025.

References

  • Patchin, Justin & Hinduja, Sameer. (2006). Bullies Move Beyond the Schoolyard A Preliminary Look at Cyberbullying. Youth Violence and Juvenile Justice. 4. 148-169. DOI: https://doi.org/10.1177/1541204006286288
  • Rosa, Hugo & Salgado Pereira, Nádia & Ribeiro, Ricardo & Ferreira, Paula & Carvalho, Joao & Oliveira, Sofia & Coheur, Luisa & Paulino, Paula & Veiga Simão, Ana Margarida & Trancoso, Isabel. (2019). Automatic cyberbullying detection: A systematic review. Computers in Human Behavior. 93. 333-345. DOI: https://doi.org/10.1016/j.chb.2018.12.021
  • K. S. Alam, S. Bhowmik and P. R. K. Prosun, "Cyberbullying Detection: An Ensemble Based Machine Learning Approach," 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, 2021, pp. 710- 715, Doi: https://doi.org/10.1109/ICICV50876.2021.9388499
  • Abdullah, Alqahtani., Mohammad, Ilyas. (2024). A Machine Learning Ensemble Model for the Detection of Cyberbullying. DOI: 10.48550/arxiv.2402.12538 https://doi.org/10.5121/ijaia.2024.15108
  • Pankaj, Shah., Shivali, Chopra. (2024). Mixed Language Text Classification Using Machine Learning: Cyberbullying Detection System. 514-518. DOI: https://doi.org/10.1201/9781003405580-83
  • Jinan, Redha, Mutar. (2024). Cyberbullying Messages Detection Using Machine Learning and Deep Learning. International journal of advances in scientific research and engineering, 10(03):19-29. DOI: https://doi.org/10.31695/IJASRE.2024.3.3
  • K. S. Raj, K. Tej, N. K. S, S. K. T and S. Vajipayajula, "Ensemble Techniques for Malicious Threat Detection," 2024 International Conference on Inventive Computation Technologies (ICICT), Lalitpur, Nepal, 2024, pp. 1543-1545, DOI: https://doi.org/10.1109/ICICT60155.2024.10544694
  • Prasad, K. L., Anusha, P., Rao, M., & Rao, Dr. K. V. (2019). A Machine Learning based Preventing the Occurrence of Cyber Bullying Messages on OSN. In International Journal of Recent Technology and Engineering (IJRTE) (Vol. 8, Issue 3, pp. 1861–1865). DOI: https://doi.org/10.35940/ijrte.a9164.078219
  • Jalda, C.S., Polimetla, U.B., Nanda, A.K., Nanda, S. (2024). A Comparison Study of Cyberbullying Detection Using Various Machine Learning Algorithms. In: Sathees kumaran, S., Zhang, Y., Balas, V.E., Hong, Tp., Pelusi, D. (eds) Intelligent Computing for Sustainable Development. ICICSD 2023. Communications in Computer and Information Science, vol 2122. Springer, Cham. DOI: https://doi.org/10.1007/978-3-031-61298-5_4
  • Bhagyashree, Kadam. (2023). Cyberbullying Detection using Machine Learning Algorithms. International Journal For Science Technology And Engineering, 11(5):1326-1328. DOI: https://doi.org/10.22214/ijraset.2023.51749
  • Muneer A, Alwadain A, Ragab MG, Alqushaibi A. Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT. Information. 2023; 14(8):467. DOI: https://doi.org/10.3390/info14080467
  • Ali, A., & Syed, A. M. (2022). Cyberbullying Detection using Machine Learning. Pakistan Journal of Engineering and Technology, 3(2), 45–50. DOI: https://doi.org/10.51846/vol3iss2pp45-50
  • Patil, P., Raul, S., Raut, D., & Nagarhalli, T. (2023). Hate Speech Detection using Deep Learning and Text Analysis. 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), 322–330. DOI: https://doi.org/10.1109/iciccs56967.2023.10142895
  • Hondor, Saragih., Jonson, Manurung. (2024). 1. Leveraging the BERT Model for Enhanced Sentiment Analysis in Multicontextual Social Media Content. Jurnal Manajemen Informatika C.I.T. Medicom, DOI: https://doi.org/10.35335/cit.Vol16.2024.766.pp82-89
  • Amisha, Sharma., Diya, Khajuria., Ayushi., Ritu, Rani., Garima, Jaiswal., Mala, Saraswat. (2023). LSTM-Based Model for Classification of Tweets. 1-7. DOI: https://doi.org/10.1109/ASIANCON58793.2023.10270665
  • Yamaguchi, A., Margatina, K., Chrysostomou, G., & Αλέτρας, Ν. (2021). Frustratingly Simple Pretraining Alternatives to Masked Language Modeling. cornell university. DOI: https://doi.org/10.48550/arxiv.2109.01819
  • Sun, Y., Hao, C., Zheng, Y., & Qiu, H. (2021). NSP-BERT: A Promptbased Few-Shot Learner Through an Original Pre-training Task--Next Sentence Prediction. cornell university. DOI: https://doi.org/10.48550/arxiv.2109.03564
  • Devlin, J., Chang, M., Lee, K., & Toutanova, K. (2018). BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding. arXiv (Cornell University). DOI: https://doi.org/10.48550/arxiv.1810.04805
  • Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780. DOI: https://doi.org/10.1162/neco.1997.9.8.1735
  • Witten, I. H., Frank, E., Hall, M. A., & Pal, C. J. (2017). Ensemble learning. In Elsevier eBooks (pp. 479–501). DOI: https://doi.org/10.1016/b978-0-12-804291-5.00012-x
  • Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017b, June 12). Attention Is All You Need. arXiv.org. https://arxiv.org/abs/1706.03762
  • Hoque, M. N., & Seddiqui, M. H. (2024). Detecting cyberbullying text using the approaches with machine learning models for the low-resource Bengali language. IAES International Journal of Artificial Intelligence, 13(1), 358. DOI: https://doi.org/10.11591/ijai.v13.i1.pp358-367
  • Chen, S., He, K., & Wang, J. (2024). Chinese Cyberbullying Detection Using XLNet and Deep Bi-LSTM Hybrid Model. Information, 15(2), 93. DOI: https://doi.org/10.3390/info15020093
  • Shibly, F. H. A., Sharma, U., & Naleer, H. M. M. (2022). Performance Comparison of Machine Learning and Deep Learning Algorithms in Detecting Online Hate Speech (pp. 695–706). Springer Nature Singapore. DOI: https://doi.org/10.1007/978-981-19-2821-5_59
  • Farasalsabila, F., Utami, E., & Hanafi, H. (2024). Deteksi Cyberbullying Menggunakan BERT dan Bi-LSTM. Jurnal Teknologi, 17(1). DOI: https://doi.org/10.34151/jurtek.v17i1.4636
  • Sunitharam, Dr. C., Nandini, P. S., & K, R. (2023). Detection of CyberBullying Through Sentimental Analysis. In International Journal of Soft Computing and Engineering (Vol. 13, Issue 1, pp. 16–20). DOI: https://doi.org/10.35940/ijsce.a3594.0313123
  • Angelis, J. D., & Perasso, G. (2020). Cyberbullying Detection Through Machine Learning: Can Technology Help to Prevent Internet Bullying? In International Journal of Management and Humanities (Vol. 4, Issue 11, pp. 57–69). DOI: https://doi.org/10.35940/ijmh.k1056.0741120
  • Sharma, P. (2023). Advancements in OCR: A Deep Learning Algorithm for Enhanced Text Recognition. In International Journal of Inventive Engineering and Sciences (Vol. 10, Issue 8, pp. 1–7). DOI: https://doi.org/10.35940/ijies.f4263.0810823
  • Prashar, S., & Bhakar, S. (2019). Real Time Cyberbullying Detection. In International Journal of Engineering and Advanced Technology (Vol. 9, Issue 2, pp. 5197–5201). DOI: https://doi.org/10.35940/ijeat.b4253.129219