Published October 27, 2022 | Version v1
Dataset Open

UNSW-NB15 and CIC-IDS2017 Labelled PCAP Data

  • 1. Texas A&M University
  • 2. United States Military Academy

Description

Packet Capture (PCAP) files of UNSW-NB15 and CIC-IDS2017 dataset are processed and labelled utilizing the CSV files. Each packet is labelled by comparing the eight distinct features: *Source IP, Destination IP, Source Port, Destination Port, Starting time, Ending time, Protocol and Time to live*.  The dimensions for the dataset is Nx1504. All column of the dataset are integers, therefore you can directly utilize this dataset in you machine learning models. Moreover, details of the whole processing and transformation is provided in the following GitHub Repo:  

https://github.com/Yasir-ali-farrukh/Payload-Byte

You can utilize the tool available at the above mentioned GitHub repo to generate labelled dataset from scratch. All of the detail of processing and transformation is provided in the following paper: 

 ```yaml
@article{Payload,  
author = "Yasir Ali Farrukh and Irfan Khan and Syed Wali and David Bierbrauer and Nathaniel Bastian",  
title = "{Payload-Byte: A Tool for Extracting and Labeling Packet Capture Files of Modern Network Intrusion Detection Datasets}",  
year = "2022",  
month = "9",  
url = "https://www.techrxiv.org/articles/preprint/Payload-Byte_A_Tool_for_Extracting_and_Labeling_Packet_Capture_Files_of_Modern_Network_Intrusion_Detection_Datasets/20714221",  
doi = "10.36227/techrxiv.20714221.v1"  
}
 

Files

Payload_data_CICIDS2017.csv

Files (5.2 GB)

Name Size Download all
md5:6344cc6af3e086cf0fe21af4299e9513
4.9 GB Preview Download
md5:17ae7b2540192938833d47a8ec9d077c
270.1 MB Preview Download

Additional details

Related works

Describes
Preprint: 10.36227/techrxiv.20714221.v1 (DOI)