There is a newer version of the record available.

Published February 20, 2025 | Version v1
Dataset Open

A Dataset of Windows System Binaries and Similarity Digests for Enhanced Forensic Analysis

  • 1. ROR icon Universidad de Zaragoza

Contributors

Researcher:

Description

This dataset contains the results of a static analysis performed on system binary files extracted from multiple versions of the Windows operating system, including 64-bit versions of Windows 7, Windows 8.1, Windows Server 2008 R2, Windows Server 2012, Windows Server 2016, and Windows Server 2019. An automated static analysis process decomposed each binary into individual functions and extracted detailed metadata. Similarity digest algorithms, such as TLSH and ssdeep, were applied to generate unique fingerprints of these function binaries, enabling efficient similarity assessment.

Files

windows-dataset.zip

Files (6.3 GB)

Name Size Download all
md5:3233d5ba8793af1778bff940e32c1740
6.3 GB Preview Download

Additional details

Funding

Ministerio de Ciencia, Innovación y Universidades
MIMFA TED2021-131115A-I00
European Commission
Recovery, Transformation and Resilience Plan
European Commission
Proyecto Estratégico CIBERSEGURIDAD EINA UNIZAR (funded by Spanish National Cybersecurity Institute)
Gobierno de Aragón
Programa de Proyectos Estratégicos de Grupos de Investigación T21-23R

Software

Programming language
Python