Published October 15, 2022 | Version V1
Dataset Open

Offensive content dataset in Urdu language

  • 1. Department of Computer Science, Capital University of Science and Technology, Kahuta Road, Sihala, Islamabad, Pakistan

Description

The archive contains python code and various feature files of offensive language dataset in urdu. The purpose of sharing this archive is to regenerate the results produced by the research article and can extend the findings.

Files

Files (403.0 MB)

Name Size Download all
md5:a276d10815ec6743e1635a1b339ba9ab
228.2 MB Download
md5:66eb967e8856a164f71c4c58243550a2
170.0 MB Download
md5:821e7f583778b242b215b701bfb96c5d
4.7 MB Download
md5:a75f06665c8553c8ee077b553dc2ef2f
3.4 kB Download
md5:64f057fcb3ca08f1b2ca9602dd4bca2e
163.6 kB Download