Preprocessed Dataset for ``Calorimetric Measurement of Multi-TeV Muons via Deep Regression"
Authors/Creators
- 1. CERN
- 2. Padua U. and INFN, Padua
- 3. Padua and INFN, Padua
- 4. INFN, Padua
- 5. INFN, Padua and Naples U. and INFN, Naples
Description
This record contains the fully-preprocessed training/validation and testing datasets used to train and evaluate the final models for "Calorimetric Measurement of Multi-TeV Muons via Deep Regression" by Jan Kieseler, Giles C. Strong, Filippo Chiandotto, Tommaso Dorigo, & Lukas Layer, (2021), arXiv:2107.02119 [physics.ins-det] (https://arxiv.org/abs/2107.02119).
The files are LZF-compressed HDF5 format and designed to be used directly with the code-base available at https://github.com/GilesStrong/calo_muon_regression. Please use the 'issues' tab on the GitHub repo for any questions or problems with these datasets.
The training dataset consists of 886,716 muons with energies in the continuous range [50,8000] GeV split into 36 subsamples (folds). The zeroth fold of this dataset is used as our validation data. The testing dataset contains 429,750 muons, generated at fixed values of muon energy (E=100, 500, 900, 1300, 1700, 2100, 2500, 2900, 3300, 3700, 4100 GeV), and split into 18 folds. The input features are the raw hits in the calorimeter (stored in a sparse COO representation), and the high-level features discussed in the paper.
Files
Files
(47.2 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:621cc50ba6fd69ac15beb77bb8a473d6
|
12.8 GB | Download |
|
md5:3dcb3cc63e19da381b65749921bbee0f
|
34.3 GB | Download |
|
md5:32d1bb3cdc01bf7034e3fc803fea49e4
|
84.1 MB | Download |
Additional details
Related works
- Is documented by
- Preprint: https://arxiv.org/abs/2107.02119 (URL)
- Requires
- Software: https://github.com/GilesStrong/calo_muon_regression/tree/v0.0 (URL)
- Software: 10.5281/zenodo.5167015 (DOI)