Dataset Open Access

Dataset used for fingerprinting of DNS over HTTPS responses.

Hynek, Karel; Cejka, Tomas

 The dataset consists of multiple different data sources:

  1. DoH enabled Firefox on Linux OS
  2. DoH enabled Firefox on Windows 10 OS
  3. DoH enabled Chrome on Windows 10 OS

 

We captured the traffic from the DoH enabled web-browsers using tcpdump. To automate the process of traffic generation, we installed Google Chrome and Mozilla Firefox into separate virtual machines and controlled them with the Selenium framework shows detailed information about used browsers and environments). Selenium simulates a user's browsing according to the predefined script and a list of domain names (i.e., URLs from Alexa's top websites list in our case).  The selenium was configured to visit pages in random order multiple times. For capturing the traffic, we used the default settings of each browser. We did not disable the DNS cache of the browser, and the random order of visiting webpages secures that the dataset contains traces influenced by DNS caching mechanisms. Each virtual machine was configured to export TLS cryptographic keys, that was used for decrypting the traffic using WireShark application. 

The WireShark text output of the decrypted traffic is provided in the dataset files. The detailed information about each file is provided in dataset README.

 

 

 

Files (105.9 MB)
Name Size
DoH-fingerprinting-dataset.tar.gz
md5:64d1d25b5ada86bf77faf95d8159b688
105.9 MB Download
135
5
views
downloads
All versions This version
Views 135100
Downloads 55
Data volume 529.7 MB529.7 MB
Unique views 9384
Unique downloads 44

Share

Cite as