Published October 6, 2023 | Version v1.5.0
Dataset Open

Simulated datasets for detector and particle flow reconstruction: CLIC detector, hit-based data, machine learning format

Description

Derived from https://zenodo.org/record/8260741, prepared in a machine-learning friendly TFDS format, ready to be used with https://zenodo.org/record/8397954.

  • clic_edm_ttbar_hits_pf10k.tar: ee -> ttbar, center of mass energy at 380 GeV, 10k events
  • clic_edm_qq_hits_pf10k.tar: ee -> Z* -> qqbar, center of mass energy at 380 GeV, 10k events

Contents

Each .tar file contains the dataset in the tensorflow-datasets (minimum version v4.9.1), array_record format.

Dataset semantics

Each dataset consists of events that can be iterated over using the tensorflow-datasets library in either tensorflow or pytorch. Each event has the following information available:

  • X: the reconstruction input features, i.e. tracks and calorimeter hits
  • ygen: the ground truth particles with the features ["PDG", "charge", "pt", "eta", "sin_phi", "cos_phi", "energy", "jet_idx"], with "jet_idx" corresponding to the gen-jet assignment of this particle
  • ycand: the baseline Pandora PF particles with the features ["PDG", "charge", "pt", "eta", "sin_phi", "cos_phi", "energy", "jet_idx"], with "jet_idx" corresponding to the gen-jet assignment of this particle

The full semantics, including the list of features for X, are available at https://github.com/jpata/particleflow/blob/v1.6/mlpf/heptfds/clic_pf_edm4hep_hits/utils_edm.py.

Notes

Funding support for the development and generation of this dataset by Estonian Research Council (ETAG) grant PSG864. The full dataset is hosted at the Julich HPC, supported by the CoE RAISE project. CoE RAISE project has received funding from the European Union's Horizon 2020 – Research and Innovation Framework Programme H2020-INFRAEDI-2019-1 under grant agreement no. 951733.

Files

Files (4.8 GB)

Name Size Download all
md5:26c7748ecb845fd63ae9551753496bd0
1.9 GB Download
md5:1b80276439f9b119cfbb7c3676457fe2
2.9 GB Download

Additional details

Related works

Cites
Dataset: 10.5281/zenodo.8260741 (DOI)