Published July 12, 2022 | Version 1
Dataset Open

Hunting for vampires and other unlikely forms of parity violation at the Large Hadron Collider: truth-jet and reco-jet datasets

  • 1. University of Cambridge
  • 2. UC, Berkeley

Description

All truth-jet and reco-jet datasets used in the paper.

Main: fragments are named `truth-jet_${MODEL}.tar.gz`, where MODEL is either `pv_msme_${lambdaPV}` (PV-mSME and lambdaPV is a floating point number with "." replaced with "p"), or `sm` for the Standard Model (lambdaPV = 0).

Within each are three directories for the independent splits:

  • train,
  • test, and
  • private_test.

 In the paper, we describe "test" as the validation set and "private_test" as the test set. Data in "private_test" were not used until models were finalized for the paper.
Within each of those are the processed results from simulations with different random seeds. They are independent shards which can be trivially combined.

Each data file is in h5 format. Its data are under the key "events" as an array with shape (n, 20). That last axis contains the reconstructed four-momenta of the hardest five jets in the order [Px, Py, Pz, E] * 5, with missing jets filled with zeros.

Truth-jet files have additional keys "flavors" and "helicities". These contain truth-level flavour and helicity information, respectively. Their shape is (n, 5), where the last axis contains the zero-padded results. Flavours are encoded in the PDG ID scheme.

Rotated: Rotated PV-mSME data are in archives named `truth-jet-rot_${HOUR}.tar.gz`.
These have similar (train, test, private_test) structures as the others, but comprise parts of the same mixed model and should be combined by subsampling to a weighted average.

HOUR is an integer rotation in [0, 23] for which the detector is rotated by an angle given in radians as HOUR * 2 pi / 24. This rotated dataset has lambdaPV=1. We demonstrate merging of these datasets in the code sharing (git, Zenodo).

Files

Files (40.7 GB)

Name Size Download all
md5:8cbca32f4093eb313d4d40818f05b24b
764.2 MB Download
md5:cfd71dda39841578e9b884f4e3b1b866
670.0 MB Download
md5:dc33d52ed7415591da3c5d25ec823562
623.4 MB Download
md5:16cac7b1858d35baff1df932668f9b07
670.5 MB Download
md5:0c0b5edee567f5cbc3e8d332b4406a5f
678.6 MB Download
md5:de9badaf44b965811f35096ed11e5899
659.6 MB Download
md5:005233814be3688e1f48c501741e2cd8
719.3 MB Download
md5:bb13d0bb67c3e4b82fbe6d102036a2d4
664.9 MB Download
md5:9120095ede0920cd715e9f09ce7364b8
717.8 MB Download
md5:f1e94574509e11043f36c6b655027158
765.1 MB Download
md5:6498f1dde73e2425e90db4b0b0176bd4
616.5 MB Download
md5:b9d4df4e7cf8e05ac7ccab3ed5b490c3
638.6 MB Download
md5:db12055f916f195b51105ceb2798d1a5
721.5 MB Download
md5:65fc997cc1cf99bb1255fe50f0a580c7
679.4 MB Download
md5:ccfca7462dcd30aba6bb907624f9120c
720.2 MB Download
md5:4b5f52cb2fef28bcb8b82216a0c1d3e0
730.0 MB Download
md5:6020f6efd3d39c56156a93e48818b36c
720.3 MB Download
md5:ab8298bda147d2735113bd5a4337cf7e
679.5 MB Download
md5:4ee39b4c6915abd57897ebad8a3c17e9
626.4 MB Download
md5:aeadb3d51f9b553e93123afc18681689
600.6 MB Download
md5:85ae1f72a3692e7f43e94f0aee732898
598.1 MB Download
md5:1ff390c6c5de620ff644669c9e49dcd2
596.6 MB Download
md5:a21b3e1b96223ba0404e3f67f21ae225
599.8 MB Download
md5:486c215669aebc8b2e19f75e1966e1fe
681.2 MB Download
md5:3d7bd770990764b5b156ca13bf447b6d
603.1 MB Download
md5:582e29d971bd67e792e883fa14e6b34f
626.8 MB Download
md5:bc963acc5176ca84802ea776bad4d863
681.3 MB Download
md5:a896400f9b3632da80c7495a684404c3
721.5 MB Download
md5:9a7b0dceb2b727c93d7b6a7b61c9914b
626.8 MB Download
md5:760e8a425e869bb95cbe093f223237ac
603.1 MB Download
md5:687788e124c5ec596578debf2eb06f0f
597.8 MB Download
md5:3e8fb3ff392d4b62727084162cfa1fdd
597.7 MB Download
md5:851d889cc99d54587ea721b008d2e4c7
600.6 MB Download
md5:9845e7febb28d36cece7611730e09951
603.6 MB Download
md5:4e973e7cb9b76507fd7c28b8f3c7c079
626.5 MB Download
md5:18daa39133269ad66d51d70209e8379f
730.3 MB Download
md5:66a588d003dd56feec5115f29e4dcfbc
583.9 MB Download
md5:8439f71b45cc6935a257e84435b13b07
584.0 MB Download
md5:e3f88f045d903b0632d6226e61a10577
584.1 MB Download
md5:bdb32ed6cdf01ad80fe1914490535fd8
584.3 MB Download
md5:4abf7e5a464f4ceb315ed8119a47aae3
584.5 MB Download
md5:f64dc01c852f14fc7fdf8ca5870eebfe
584.8 MB Download
md5:0ff36e524199b50280ff252684333482
585.2 MB Download
md5:604255d5d1ab9a1d54e81c2c9903ee59
585.6 MB Download
md5:96585e9ce624e03d230f53172a6eb6e7
586.0 MB Download
md5:c8fc1352e3e1f35d0e25a898e494fd65
586.7 MB Download
md5:12650e3a8340c0994b1aab6e7eb49e50
587.1 MB Download
md5:7f43f814fc6d2ceb4a55b1b3968d0310
587.6 MB Download
md5:fd81c371d62290562098600528d73891
588.5 MB Download
md5:d10d8e42e5cd6820887da3a9e78f4703
588.9 MB Download
md5:799eac6f53fe0a25ba15d98e403bb120
589.8 MB Download
md5:41d177436fadeb67482d2078230573d8
590.3 MB Download
md5:cad390e0f6137bcd949985c94f60dc7b
591.3 MB Download
md5:08d3ae561cf54bae8c5720b74807d1c0
592.0 MB Download
md5:8d64216a968321f63ebea06041052ac0
593.0 MB Download
md5:57fc660c24ec43a27f401e799b9a5a80
594.0 MB Download
md5:12c3eb6a3c7cd777b851f632e56e80eb
605.0 MB Download
md5:d49f97aae3cc9503c04ec7969db2de51
618.8 MB Download
md5:19ebaa65a28028e295a6c4adf690ff15
634.4 MB Download
md5:9813139f1c4c74cdb4f169fd497b0426
651.4 MB Download
md5:e076f69430de28e1ba118c6d3705fe91
669.8 MB Download
md5:78215c617afafae32927d3965c19d3f5
689.5 MB Download
md5:aa57b2c0a8615f2a7bf4c3858426e823
709.5 MB Download
md5:43078183f13106eaea170ff75064e122
583.9 MB Download

Additional details

Related works

Is continued by
Dataset: 10.5281/zenodo.6826628 (DOI)
Dataset: 10.5281/zenodo.6823457 (DOI)
Dataset: 10.5281/zenodo.6527112 (DOI)
Is referenced by
Software: 10.5281/zenodo.6827724 (DOI)
Is supplement to
Preprint: 10.48550/arXiv.2205.09876 (DOI)