Published December 1, 2022 | Version v2
Dataset Open

Classification of IT Support Tickets

  • 1. Federal Institute of Santa Catarina

Description

Collection of 2229 support tickets manually classified into 7 categories, obtained from a IT support company in the Florianópolis (Brazil) region. Each ticket is represented by an unstructured text field, which is typed by the user that opened the call. The classification process was performed in 2020 by three IT support professionals. The corpus contains tickets in many languages, mainly English, German, Portuguese and Spanish.

All Personal Identifiable Information (PII) and sensitive information were removed (substituted by a tag indicating the original content, for instance: the sentence "this text was written by Leonardo" is converted to "this text was written by [NAME]"). The removal was performed in three steps: first, the automated machine learning-based tool AWS Comprehend PII Removal was used; then, a sequence of custom regular expressions was applied; last, the entire corpus was manually verified.

 

Files

confusion_matrix.png

Files (1.2 MB)

Name Size Download all
md5:811cee209e221033db06587d7d331dce
4.1 kB Download
md5:99656383ba6f3f11a49493eb5ba5d6ab
81.0 kB Preview Download
md5:f5f206a54b2b2a58c820c03349ba8cd1
262 Bytes Preview Download
md5:185f22256544b811597e697b6a63f79d
678 Bytes Preview Download
md5:aa8a0ad500fedd3bf3fa7fb894b5d168
50.9 kB Preview Download
md5:3e446943a1aef153026b858984ba9b03
340 Bytes Download
md5:db8b341724a6d161a5e9d327f96a2fd5
461.1 kB Download
md5:f9a01485eba452d5b20f83c322345878
153.5 kB Download
md5:02328c4e09a78a41acb22ce17e6e0312
164.5 kB Preview Download
md5:0a79f0c2f95fcdb407f24b2c0ad7f7d1
215.0 kB Preview Download
md5:2f054c0e3d8cbbbd76adf30d13a7710a
11.1 kB Preview Download
md5:90876921486a26a67bc1976da87305d9
27.8 kB Preview Download