Classification of IT Support Tickets
Description
Collection of 2229 support tickets manually classified into 7 categories, obtained from a IT support company in the Florianópolis (Brazil) region. Each ticket is represented by an unstructured text field, which is typed by the user that opened the call. The classification process was performed in 2020 by three IT support professionals. The corpus contains tickets in many languages, mainly English, German, Portuguese and Spanish.
All Personal Identifiable Information (PII) and sensitive information were removed (substituted by a tag indicating the original content, for instance: the sentence "this text was written by Leonardo" is converted to "this text was written by [NAME]"). The removal was performed in three steps: first, the automated machine learning-based tool AWS Comprehend PII Removal was used; then, a sequence of custom regular expressions was applied; last, the entire corpus was manually verified.
Files
confusion_matrix.png
Files
(1.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:811cee209e221033db06587d7d331dce
|
4.1 kB | Download |
|
md5:99656383ba6f3f11a49493eb5ba5d6ab
|
81.0 kB | Preview Download |
|
md5:f5f206a54b2b2a58c820c03349ba8cd1
|
262 Bytes | Preview Download |
|
md5:185f22256544b811597e697b6a63f79d
|
678 Bytes | Preview Download |
|
md5:aa8a0ad500fedd3bf3fa7fb894b5d168
|
50.9 kB | Preview Download |
|
md5:3e446943a1aef153026b858984ba9b03
|
340 Bytes | Download |
|
md5:db8b341724a6d161a5e9d327f96a2fd5
|
461.1 kB | Download |
|
md5:f9a01485eba452d5b20f83c322345878
|
153.5 kB | Download |
|
md5:02328c4e09a78a41acb22ce17e6e0312
|
164.5 kB | Preview Download |
|
md5:0a79f0c2f95fcdb407f24b2c0ad7f7d1
|
215.0 kB | Preview Download |
|
md5:2f054c0e3d8cbbbd76adf30d13a7710a
|
11.1 kB | Preview Download |
|
md5:90876921486a26a67bc1976da87305d9
|
27.8 kB | Preview Download |