Dataset Open Access

Dataset used for fingerprinting of DNS over HTTPS responses.

Hynek, Karel; Cejka, Tomas

 The dataset consists of multiple different data sources:

  1. DoH enabled Firefox on Linux OS
  2. DoH enabled Firefox on Windows 10 OS
  3. DoH enabled Chrome on Windows 10 OS

 

We captured the traffic from the DoH enabled web-browsers using tcpdump. To automate the process of traffic generation, we installed Google Chrome and Mozilla Firefox into separate virtual machines and controlled them with the Selenium framework shows detailed information about used browsers and environments). Selenium simulates a user's browsing according to the predefined script and a list of domain names (i.e., URLs from Alexa's top websites list in our case).  The selenium was configured to visit pages in random order multiple times. For capturing the traffic, we used the default settings of each browser. We did not disable the DNS cache of the browser, and the random order of visiting webpages secures that the dataset contains traces influenced by DNS caching mechanisms. Each virtual machine was configured to export TLS cryptographic keys, that was used for decrypting the traffic using WireShark application. 

The WireShark text output of the decrypted traffic is provided in the dataset files. The detailed information about each file is provided in dataset README.

 

 

 

Files (105.9 MB)
Name Size
DoH-fingerprinting-dataset.tar.gz
md5:64d1d25b5ada86bf77faf95d8159b688
105.9 MB Download
329
15
views
downloads
All versions This version
Views 329284
Downloads 1515
Data volume 1.6 GB1.6 GB
Unique views 259245
Unique downloads 1111

Share

Cite as