Published September 1, 2020 | Version 1.0
Dataset Open

Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms

  • 1. University of Piraeus

Description

This repository contains a dataset for the research of domain generation algorithms (DGAs) and machine learning. More precisely, it targets dictionary-based DGAs.

Constantinos Patsakis, Fran Casino: "Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms", Journal of Information Security and Applications, 2021.

Features ordered as in the shared dataset:

  • Family: DGA that the domain belongs to
  • SLD: SLD of the Domain
  • L-LEN: The length of Domain
  • L-DIG: The number of digits in Domain
  • L-CON-MAX: The maximum number of consecutive consonants Domain
  • R-CON-VOW: Number of consonants divided by L-LEN 
  • L-SYM: The number of special characters
  • R-SYM-LEN: L-SYM divided by L-LEN
  • R-Dom-3G: Ratio of benign grams in Dom-3G
  • R-Dom-4G: Ratio of benign grams in Dom-4G
  • R-Dom-5G: Ratio of benign grams in Dom-5G
  • L-W2: Number of words with more than 2 characters in Domain
  • L-W3: Number of words with more than 3 characters in Domain
  • R-WS-LEN: Dom-WS divided by L-LEN
  • R-WDS-LEN: Dom-WDS divided by L-LEN
  • R-W2-LEN: Dom-W2 divided by L-LEN
  • R-W3-LEN: Dom-W3 divided by L-LEN
  • M2-Dom-Ws: 2-Chain Markov English grams applied to Dom-WS
  • M2-Dom-WDS: 2-Chain Markov English grams applied Dom-WDS
  • E-Dom-WS: Entropy of Dom-WS 
  • E-Dom-WDS: Entropy of Dom-WDS
  • E-Dom-W2: Entropy of Dom-W2
  • E-Dom-W3: Entropy of Dom-W3

Files

dictionary_DGAs_dataset.zip

Files (52.2 MB)

Name Size Download all
md5:92cd328d57a2ea5126eac1c1ef19a179
52.2 MB Preview Download

Additional details

Funding

LOCARD – Lawful evidence collecting and continuity platform development 832735
European Commission
YAKSHA – Cybersecurity Awareness and Knowledge Systemic High-level Application 780498
European Commission
CyberSec4Europe – Cyber Security Network of Competence Centres for Europe 830929
European Commission