Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
Description
This paper presents a comprehensive comparative analysis of Natural Language Processing (NLP) methods for automated toxicity detection in online gaming chats. Traditional machine learning models with embeddings, large language models (LLMs) with zero-shot and few-shot prompting, fine-tuned transformer models, and retrieval-augmented generation (RAG) approaches are evaluated. The evaluation framework assesses three critical dimensions: classification accuracy, processing speed, and computational costs. A hybrid moderation system architecture is proposed that optimizes human moderator workload through automated detection and incorporates continuous learning mechanisms. The experimental results demonstrate significant performance variations across methods, with fine-tuned DistilBERT achieving optimal accuracy-cost trade-offs. The findings provide empirical evidence for deploying cost-effective, efficient content moderation systems in dynamic online gaming environments.
Files
Efficient_Toxicity_Detection_in_Gaming_Chats.pdf
Files
(395.9 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:cc2b57c4eae85f11605a3ff7ffe41667
|
395.9 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://aclanthology.org/2024.nlp4dh-1.10/ (URL)
Software
- Repository URL
- https://github.com/Yegmina/toxic-content-detection-agent
- Programming language
- Python
- Development Status
- Active