Published November 24, 2023 | Version v3
Dataset Open

PM100: A Job Power Consumption Dataset of a Large-Scale HPC System

Description

The dataset is a collection of jobs extracted from the job_table data of M100 (https://doi.org/10.5281/zenodo.7588815), a collection of data extracted from a Tier-0 supercomputer hosted at CINECA (Marconi100, https://www.hpc.cineca.it/hardware/marconi100).  The original job data present in M100 are filtered out by considering only the jobs running exclusively on the resources. Each job entry included in PM100 contains the power consumption of the job recorded at Node level, CPU level and Memory level. The final dataset contains 231116 jobs, executed on Marconi100 between May and October 2020. 

The dataset is stored as a parquet file, where each entry contains the information on a job execution. 

The structure of the data, as well as the code to generate them, is contained in the official GitHub repository of the project: https://github.com/francescoantici/PM100-data/.

Files

Files (287.2 MB)

Name Size Download all
md5:cf85f171a5ed5ff1d951d93f9bcbc6ef
287.2 MB Download

Additional details

Funding

European Commission
REGALE - An open architecture to equip next generation HPC applications with exascale capabilities 956560