Published May 2, 2025 | Version v1
Dataset Open

Training Data for Gollum in Higgs Uncertainty Challenge

Description

This is a subset of the training data for "Gollum", Team HEPHY's submission to the FAIR Universe Higgs Uncertainty Challenge. For the full training, we used systematic variations up to 3 standard deviations. All of these can be generated from the nominal file (provided by the challenge) and the code that the challenge provided. 

Data was taken from CodeBench: https://www.codabench.org/datasets/download/b9e59d0a-4db3-4da4-b1f8-3f609d1835b2/, and systematic variations were applied according to the description https://arxiv.org/abs/2410.02867

Gollum Code: https://github.com/HephyAnalysisSW/GOLLUM

Gollum Publication: TBA

FAIR Univerese Challenge: https://www.codabench.org/competitions/2977, https://github.com/FAIR-Universe/HEP-Challenge/tree/master/

The files are:

  • nominal.h5 is a dataset without systematic variations
  • met, jes, tes in the file name indicates a variation of MET (missing energy), jet energy scale, or tau energy scale to the value that follows. For example: the file tes_0p99_jes_0p99.h5 includes events where the tau and jet energy scales were both multiplied with a factor of 0.99, corresponding to nuicances parameters with the value of -1.
  • Normalization-type uncertainties are not included, as these samples can be obtained by changing the corresponding event weights. 

The file toy_mu_2.h5 is a toy dataset with true value of the signal strength mu set to 2. The detailed values of the nuisance parameters (in the format (1+alpha), see the documentation of the Challenge) are stored in a dictionary that can be found in toy_mu_2.pkl


import pickle

myfile = open('toy_mu_2.pkl', 'rb')
nuisances = pickle.load(myfile)
print(nuisances)

>>> {'mu': 2.0, 'tes': 0.9945927027846702, 'jes': 1.0094985245559578, 'soft_met': 0.16120480304203255, 'ttbar_scale': 0.9866736788084463, 'diboson_scale': 1.0610956857813547, 'bkg_scale': 0.999100305445466}


Files

Files (45.3 GB)

Name Size Download all
md5:b39e8fdaae5a79376feca6c3374b4ddb
2.2 GB Download
md5:73625c9b1c7b9956fad3256d11a47641
2.2 GB Download
md5:60f12eb4517445756d5b0f45601b0926
2.2 GB Download
md5:effa43ea094c4ca57b2913002337c370
2.3 GB Download
md5:714b937926d3701bbce93ea3cb42ffdb
2.3 GB Download
md5:4924154fca2514dfd96e5685881c453c
2.3 GB Download
md5:1230841ffb5df88698b69a7015cea45c
2.3 GB Download
md5:40bc53682b966028d5f7d79b50c8e734
2.3 GB Download
md5:318c2690a37724ec805b19ea7fef9f13
4.5 GB Download
md5:24164f2bc83acc012624d03efe0fa2a6
2.2 GB Download
md5:af7fa8a4556c0f63f38521a3ceabd9fd
2.2 GB Download
md5:03199f1334ecc2dacf49ee58e08bcf85
2.2 GB Download
md5:e5ad23053f1f50497c7b7185cb3283bc
2.3 GB Download
md5:b0c8687d7cb3bc81d7929b402a320aae
2.2 GB Download
md5:067e27eb665549bf2c426e0164cb77bd
2.3 GB Download
md5:5d38c048d26f6fc8dd0659feee7595f3
2.2 GB Download
md5:e4f8eab1be733b96e484bf91bc5a2495
2.3 GB Download
md5:e3fed5b6a5d8337eb8e539756d91bcdb
2.3 GB Download
md5:8597002eafcca88f404835e71f53a6d9
2.3 GB Download
md5:174c358c72af960a1a78021e753203e2
225.8 MB Download
md5:c3e695891f43dddd660c0bddaf1daecb
294 Bytes Download

Additional details

Software

Repository URL
https://github.com/HephyAnalysisSW/GOLLUM
Programming language
Python