Dataset Open Access

Dataset used for HTTPS traffic classification using packet burst statistics

Tropkova Zdena; Hynek Karel; Cejka Tomas

We are publishing a dataset we created for the HTTPS traffic classification.

Since the data were captured mainly in the real backbone network, we omitted IP addresses and ports. The datasets consist of calculated from bidirectional flows exported with flow probe Ipifixprobe. This exporter can export a sequence of packet lengths and times and a sequence of packet bursts and time. For more information, please visit ipfixprobe repository (Ipifixprobe).

 

During our research, we divided HTTPS into five categories: L -- Live Video Streaming, P -- Video Player, M -- Music Player, U -- File Upload, D -- File Download, W -- Website, and other traffic.

We have chosen the service representatives known for particular traffic types based on the Alexa Top 1M list and Moz's list of the most popular 500 websites for each category.  We also used several popular websites that primarily focus on the audience in our country. The identified traffic classes and their representatives are provided below:

  • Live Video Stream Twitch, Czech TV, YouTube Live
  • Video Player DailyMotion, Stream.cz, Vimeo, YouTube
  • Music Player AppleMusic, Spotify, SoundCloud
  • File Upload/Download FileSender, OwnCloud, OneDrive, Google Drive
  • Website and Other Traffic Websites from Alexa Top 1M list

 

 


 

 

Files (189.0 MB)
Name Size
HTTPS-clf-dataset.csv
md5:4272a6d36d6a6db7a78fb17a78315db2
189.0 MB Download
99
96
views
downloads
All versions This version
Views 9999
Downloads 9696
Data volume 18.1 GB18.1 GB
Unique views 7373
Unique downloads 5656

Share

Cite as