Published July 4, 2023 | Version 2
Journal article Open

Dileptonic ttbar Neutrino Regression Dataset

  • 1. University of Geneva
  • 2. Harvard University

Description

Two sets of MC simulated events for the training/evaluation of neutrino regression models. The two datasets use different matrix element generators (MadGraph and Pythia).

  • The nominal (Magraph) dataset contains 940605 events split across 4 files for training and one file containing 75496 events for final evaluation
  • The alternative (Pythia) dataset contains 1248315 events split across 9 files for training and one file containing 138128 events for final evaluation.

Simulation Details

  • All events are generated from simulated proton-proton collisions at a center-of-mass energy of 13 TeV.
  • The mass of the top quark is set to 173 GeV.
  • The nominal dataset has MadGraph (v3.1.0) for ME calculation and Pythia8 (v8.243) for the shower, while the alternative dataset uses Pythia8 (v8.307) for both ME and shower.
  • Both samples are interfaced to Delphes (v3.4.2) for detector simulation with a parametrization that mimics the response of the ATLAS detector.

 

Event Reconstruction and Selection

  • Jets are reconstructed using energy-flow objects and the anti-kt algorithm with R=0.4.
  • Jet b-tagging corresponding to an inclusive signal efficiency of 70%.
  • At least two jets with pT > 25 GeV in the range |eta|<2.5 are required.
  • Exactly two reconstructed electrons or muons with pT > 15 GeV in the range |eta|<2.5 is required.

 

File Contents

Each HDF file contains the “delphes” table which holds multiple arrays and structured arrays.

The reconstructed information includes:

  • MET: The missing transverse momentum of the event (stored using polar cords)
    • Keys: MET, phi
  • leptons: The single reconstructed lepton in the event:
    • Keys: pt, eta, phi, energy, charge, type
  • jets: A zero padded table of the leading 10 jets in the event
    • Keys: pt, eta, phi, energy, is_tagged
  • njets: Numpy array holding the number of reconstructed jets in the event
  • nbjets: Numpy array holding the number of b-tagged jets

The truth/generator level information includes:

  • neutrinos: Truth level information of the single neutrino in the event
    • Keys: PDGID, pt, eta, phi, mass
  • truth_quarks: Table of information for the two b-quarks in the event
    • Keys: PDGID, pt, eta, phi, mass
  • truth_particles: Table of information of all truth level particles in the event
    • Keys: PDGID, pt, eta, phi, mass
  • jets_indices: A numpy array containing the identity of each of the jets in the reconstructed jet array (above)
    • 0 = b-jet, 1 = anti b-jet, -1 = other

Files

Files (1.6 GB)

Name Size Download all
md5:9d2843ba334508c29ca0aae13cd3a01c
107.1 MB Download
md5:0a558140c0ec6c82f7465f8823f3269e
107.2 MB Download
md5:ebd54b215af4588de7bfcb395023da70
107.4 MB Download
md5:82f92efc12f89f4cb74ec07c19278554
107.4 MB Download
md5:5001ffec72d733558ca26cefdd9a60ce
107.4 MB Download
md5:70778f1a43cb20b7b268f2055cf261a1
107.3 MB Download
md5:20cb3b9c727eaaf725e84c9ff4123821
107.6 MB Download
md5:5ff9ab0dee2eef110ee67e604edf0712
107.4 MB Download
md5:fedee3ac9f0744d07d9234386a68d371
107.2 MB Download
md5:b1f167df15e150db6518b44800d839ee
108.3 MB Download
md5:95ab2eb5446ffda021350eabb840f726
42.2 MB Download
md5:00accba8b670390ab438ec1a31278123
130.9 MB Download
md5:a0736df44d1ed1ff24101d25a4a072d9
131.3 MB Download
md5:5481d31f68fe664bcf25c0cd1e9ef2ba
130.7 MB Download
md5:91597176072261a07917a403298a1813
132.4 MB Download