Published August 5, 2021 | Version v1
Dataset Open

Preprocessed Dataset for ``Calorimetric Measurement of Multi-TeV Muons via Deep Regression"

  • 1. CERN
  • 2. Padua U. and INFN, Padua
  • 3. Padua and INFN, Padua
  • 4. INFN, Padua
  • 5. INFN, Padua and Naples U. and INFN, Naples

Description

This record contains the fully-preprocessed training/validation and testing datasets used to train and evaluate the final models for "Calorimetric Measurement of Multi-TeV Muons via Deep Regression" by Jan Kieseler, Giles C. Strong, Filippo Chiandotto, Tommaso Dorigo, & Lukas Layer, (2021), arXiv:2107.02119 [physics.ins-det] (https://arxiv.org/abs/2107.02119).

The files are LZF-compressed HDF5 format and designed to be used directly with the code-base available at https://github.com/GilesStrong/calo_muon_regression. Please use the 'issues' tab on the GitHub repo for any questions or problems with these datasets.

The training dataset consists of 886,716 muons with energies in the continuous range [50,8000] GeV split into 36 subsamples (folds). The zeroth fold of this dataset is used as our validation data. The testing dataset contains 429,750 muons, generated at fixed values of muon energy (E=100, 500, 900, 1300, 1700, 2100, 2500, 2900, 3300, 3700, 4100 GeV), and split into 18 folds. The input features are the raw hits in the calorimeter (stored in a sparse COO representation), and the high-level features discussed in the paper.

Files

Files (47.2 GB)

Name Size Download all
md5:621cc50ba6fd69ac15beb77bb8a473d6
12.8 GB Download
md5:3dcb3cc63e19da381b65749921bbee0f
34.3 GB Download
md5:32d1bb3cdc01bf7034e3fc803fea49e4
84.1 MB Download

Additional details

Related works