Published April 1, 2026 | Version v1
Dataset Open

Pythia8 Quark and Gluon Jets (float16)

  • 1. EDMO icon Brown University

Description

A float16 (half-precision) version of the quark and gluon jet dataset originally published by Komiske, Metodiev, and Thaler (Zenodo record 3164691). Only the 20-file subset without charm and bottom quark jets is included here. All simulation parameters and jet selection criteria are identical to the original:

  • Pythia 8.226, √s = 14 TeV
  • Quarks from WeakBosonAndParton:qg2gmZq, gluons from WeakBosonAndParton:qqbar2gmZg with the Z decaying to neutrinos
  • FastJet 3.3.0, anti-k_t jets with R = 0.4
  • p_T^jet ∈ [500, 550] GeV, |y^jet| < 1.7

There are 20 files, each in compressed NumPy format (QG_jets_fp32_0.npz through QG_jets_fp32_19.npz). Each file contains two arrays:

  • X: (100000, M, 4) — 50k quark and 50k gluon jets, randomly sorted, padded to max multiplicity M, with particle features (pt, rapidity, azimuthal angle, pdgid) stored as float32
  • y: (100000,) — jet labels, gluon = 0, quark = 1

The original dataset stores X in float64. Here X has been cast to float16, approximately halving file size. The y labels are unchanged. Users should be aware that float16 has limited dynamic range and precision.

If you use this dataset, please cite the original Zenodo record and its associated paper:

  • Komiske, Metodiev, Thaler, Energy Flow Networks: Deep Sets for Particle Jets, JHEP 01 (2019) 121, arXiv:1810.05165

Files

Files (576.7 MB)

Name Size Download all
md5:bc3b179350016b58f2cdc30ea7840baf
28.9 MB Download
md5:1dfadd420ff590473f72588a97e22918
28.8 MB Download
md5:922f5f98802afc96915bc30ddcb44a25
28.8 MB Download
md5:0b69da47a7a121e10a6fcc0803c95ede
28.8 MB Download
md5:7f17ebcbaef35b6a3b8548e815e6f0c8
28.9 MB Download
md5:002e6226da7f346cff04b94f2c19e51b
28.9 MB Download
md5:f40c65ecb390acdca08d91e19eb77d8a
28.8 MB Download
md5:026a6f195df37ae86246435f9e08c862
28.8 MB Download
md5:bfcf80cce5c3b4e3ad4f40581abef8eb
28.9 MB Download
md5:d7fa9c019dffb237c58736605cb7a2ce
28.8 MB Download
md5:eac256634f53540d5bb60a5b40e74d83
28.8 MB Download
md5:2db1b79216a300a72febffc2b1ed4a26
28.9 MB Download
md5:edd457892a8c7c0df3ae2b690ecca8c6
28.8 MB Download
md5:cf6d364d6f963242e93edbb928f57ed1
28.8 MB Download
md5:db821f9b7d2fe530da867a75c3cb76ae
28.8 MB Download
md5:12aaae7acafdd98b60604156d93b469f
28.8 MB Download
md5:cef891f0fa9397802f9950a0f80e3109
28.9 MB Download
md5:66960aed0ea535a32181b12611f7ac07
28.8 MB Download
md5:3355e7d91047ad160be99fc894610db1
28.9 MB Download
md5:c8fe7002d0405220616d42d502234265
28.8 MB Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.2658763 (DOI)
Is variant form of
Dataset: 10.5281/zenodo.19362155 (DOI)
Dataset: 10.5281/zenodo.19362689 (DOI)

References

  • P. T. Komiske, E. M. Metodiev, EnergyFlow, https://energyflow.network
  • P. T. Komiske, E. M. Metodiev, J. Thaler, Energy Flow Networks: Deep Sets for Particle Jets, JHEP 01 (2019) 121, arXiv:1810.05165