Published October 29, 2024
| Version v1
Dataset
Open
Cloud Incident Reports (2016-2024)
Description
NOTE: Our work is under review, and this dataset is released for open science purposes.
💾 data/ # Datasets (AWS, AZURE, GCP)
├── 1_raw_data/ # Original incident reports
├── 2_clean_data/ # Processed clean data
├── 3_sample_data/ # Sampled data by K-means clustering
├── 4_label_data/ # Annotated data for evaluation
└── data_process.py # Data process, clean, and sample
| ID | Name | Period | #Rows | #Labeled | Avg.Words |
| 1 | AWS | 2016-2022 | 774 | 150(19%) | 151 |
| 2 | AZURE | 2019-2024 | 127 | 95(75%) | 575 |
| 3 | GCP | 2016-2021 | 2,186 | 215(10%) | 533 |
| TOTAL | TOTAL | 2016-2024 | 3,087 | 460(15%) | - |
Files
data.zip
Files
(5.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:1bb311c9d08c4853a1373e5370231d54
|
5.8 MB | Preview Download |