4010620
doi
10.5281/zenodo.4010620
oai:zenodo.org:4010620
Fran Casino
University of Piraeus
Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms
Constantinos Patsakis
University of Piraeus
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
DGAs
<p>This repository contains a dataset for the research of domain generation algorithms (DGAs) and machine learning. More precisely, it targets dictionary-based DGAs.</p>
<p><em>Constantinos Patsakis, Fran Casino: "Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms", Journal of Information Security and Applications, 2021.</em></p>
<p>Features ordered as in the shared dataset:</p>
<ul>
<li>Family: DGA that the domain belongs to</li>
<li>SLD: SLD of the Domain</li>
<li>L-LEN: The length of Domain</li>
<li>L-DIG: The number of digits in Domain</li>
<li>L-CON-MAX: The maximum number of consecutive consonants Domain</li>
<li>R-CON-VOW: Number of consonants divided by L-LEN </li>
<li>L-SYM: The number of special characters</li>
<li>R-SYM-LEN: L-SYM divided by L-LEN</li>
<li>R-Dom-3G: Ratio of benign grams in Dom-3G</li>
<li>R-Dom-4G: Ratio of benign grams in Dom-4G</li>
<li>R-Dom-5G: Ratio of benign grams in Dom-5G</li>
<li>L-W2: Number of words with more than 2 characters in Domain</li>
<li>L-W3: Number of words with more than 3 characters in Domain</li>
<li>R-WS-LEN: Dom-WS divided by L-LEN</li>
<li>R-WDS-LEN: Dom-WDS divided by L-LEN</li>
<li>R-W2-LEN: Dom-W2 divided by L-LEN</li>
<li>R-W3-LEN: Dom-W3 divided by L-LEN</li>
<li>M2-Dom-Ws: 2-Chain Markov English grams applied to Dom-WS</li>
<li>M2-Dom-WDS: 2-Chain Markov English grams applied Dom-WDS</li>
<li>E-Dom-WS: Entropy of Dom-WS </li>
<li>E-Dom-WDS: Entropy of Dom-WDS</li>
<li>E-Dom-W2: Entropy of Dom-W2</li>
<li>E-Dom-W3: Entropy of Dom-W3</li>
</ul>
Zenodo
2020-09-01
info:eu-repo/semantics/other
4010619
1.0
award_title=Lawful evidence collecting and continuity platform development; award_number=832735; award_identifiers_scheme=url; award_identifiers_identifier=https://cordis.europa.eu/projects/832735; funder_id=00k4n6c32; funder_name=European Commission;
award_title=Cybersecurity Awareness and Knowledge Systemic High-level Application; award_number=780498; award_identifiers_scheme=url; award_identifiers_identifier=https://cordis.europa.eu/projects/780498; funder_id=00k4n6c32; funder_name=European Commission;
award_title=Cyber Security Network of Competence Centres for Europe; award_number=830929; award_identifiers_scheme=url; award_identifiers_identifier=https://cordis.europa.eu/projects/830929; funder_id=00k4n6c32; funder_name=European Commission;
1608581981.85421
52219491
md5:92cd328d57a2ea5126eac1c1ef19a179
https://zenodo.org/records/4010620/files/dictionary_DGAs_dataset.zip
public
10.5281/zenodo.4010619
isVersionOf
doi