Published June 8, 2021 | Version v1
Dataset Open

Dataset used for HTTPS traffic classification using packet burst statistics

  • 1. FIT CTU
  • 2. FIT CTU & CESNET
  • 3. CESNET

Description

We are publishing a dataset we created for the HTTPS traffic classification.

Since the data were captured mainly in the real backbone network, we omitted IP addresses and ports. The datasets consist of calculated from bidirectional flows exported with flow probe Ipifixprobe. This exporter can export a sequence of packet lengths and times and a sequence of packet bursts and time. For more information, please visit ipfixprobe repository (Ipifixprobe).

 

During our research, we divided HTTPS into five categories: L -- Live Video Streaming, P -- Video Player, M -- Music Player, U -- File Upload, D -- File Download, W -- Website, and other traffic.

We have chosen the service representatives known for particular traffic types based on the Alexa Top 1M list and Moz's list of the most popular 500 websites for each category.  We also used several popular websites that primarily focus on the audience in our country. The identified traffic classes and their representatives are provided below:

  • Live Video Stream Twitch, Czech TV, YouTube Live
  • Video Player DailyMotion, Stream.cz, Vimeo, YouTube
  • Music Player AppleMusic, Spotify, SoundCloud
  • File Upload/Download FileSender, OwnCloud, OneDrive, Google Drive
  • Website and Other Traffic Websites from Alexa Top 1M list

 

 


 

 

Files

HTTPS-clf-dataset.csv

Files (189.0 MB)

Name Size Download all
md5:4272a6d36d6a6db7a78fb17a78315db2
189.0 MB Preview Download

Additional details

Funding

SAPPAN – Sharing and Automation for Privacy Preserving Attack Neutralization 833418
European Commission