Published March 25, 2025 | Version v1
Dataset Open

Simulated and Reconstructed Events with CMS and DELPHES

  • 1. ROR icon Weizmann Institute of Science
  • 2. ROR icon National Energy Research Scientific Computing Center

Description

We provide the dataset used for fast simulation studies.

The dataset consists of preprocessed CMS Open Data for the ttbar,  gg->H->4l, QCD 470, and QCD 600 processes. Each file includes a branch with stable input particles before detector simulation and a branch with the reconstructed particle flow candidates. A 1 GeV pT cut is applied to the particle flow candidates, while a 0.25 GeV cut is applied to the stable input particles. Additionally, a |η| < 2.7 cut is applied to both collections.

Our model was trained on the first 2,800,000 events from the `qcd470_ttbar_train_cms.root` file, with the remaining 35,000 events used for validation.

Each test file contains 100,000 events.

For comparison, we provide detector simulations obtained with DELPHES 3.5 for the same set of truth-level events. We used the CMS card with pileup, including an average of 6.35 pileup vertices per event. To match the full simulation, we disabled the smearing of the truth primary vertex in Delphes, while the pileup vertices were smeared using the default values.

Files

Files (48.1 GB)

Name Size Download all
md5:1ac278939614d723ab7ae4594e0932df
666.2 MB Download
md5:27ef74adea173f479150902521b4e144
1.4 GB Download
md5:28784140ad36a0fa52e6deb52f647605
1.3 GB Download
md5:f12584abd8bc2ba52b4e41937fa9af29
2.3 GB Download
md5:0ac05fd29b5a806df2e01ccd4402b3f2
35.4 GB Download
md5:d3784927e453c18773494281612a5b5a
1.3 GB Download
md5:f4dea1acf0b5dedca9ebff0e17f83c74
2.3 GB Download
md5:acc16fd7a44304e1363e05e484c8516f
1.2 GB Download
md5:d0cacfff6b143d9e74b0b453a9671d64
2.2 GB Download

Additional details

Dates

Available
2025