Kasieczka, Gregor
Plehn, Tilman
Thompson, Jennifer
Russel, Michael
2019-03-22
<p>A set of MC simulated training/testing events for the evaluation of top quark tagging architectures.</p>
<p>In total 1.2M training events, 400k validation events and 400k test events. Use “train” for training, “val” for validation during the training and “test” for final testing and reporting results.</p>
<p><strong>Description</strong></p>
<ul>
<li>
<p>14 TeV, hadronic tops for signal, qcd diets background, Delphes ATLAS detector card with Pythia8</p>
</li>
<li>
<p>No MPI/pile-up included</p>
</li>
<li>
<p>Clustering of particle-flow entries (produced by Delphes E-flow) into anti-kT 0.8 jets in the pT range [550,650] GeV</p>
</li>
<li>
<p>All top jets are matched to a parton-level top within ∆R = 0.8, and to all top decay partons within 0.8</p>
</li>
<li>
<p>Jets are required to have |eta| < 2</p>
</li>
<li>
<p>The leading 200 jet constituent four-momenta are stored, with zero-padding for jets with fewer than 200</p>
</li>
<li>
<p>Constituents are sorted by pT, with the highest pT one first</p>
</li>
<li>
<p>The truth top four-momentum is stored as truth_px etc.</p>
</li>
<li>
<p>A flag (1 for top, 0 for QCD) is kept for each jet. It is called is_signal_new</p>
</li>
<li>
<p>The variable "ttv" (= test/train/validation) is kept for each jet. It indicates to which dataset the jet belongs. It is redundant as the different sets are already distributed as different files.</p>
</li>
</ul>
https://doi.org/10.5281/zenodo.2603256
oai:zenodo.org:2603256
Zenodo
https://doi.org/10.5281/zenodo.2603255
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
Top Quark Tagging Reference Dataset
info:eu-repo/semantics/other