Published July 23, 2021 | Version v1
Dataset Open

Simulated pp collisions at 13 TeV with 2 leptons + 1 b jet final state and selected benchmark Beyond the Standard Model signals

Description

This data-set is comprised of simulated events of pp collisions at 13 TeV with 2 leptons + 1 bottom jet sinal state, with HT > 500 GeV. It includes the following samples

  • Standard-Model background (bkg), generated at leading order includes the sub-samples Z+Jets, ttbar, WW, WZ, and ZZ.
    • The processes were generated in kinematic regions to ensure good statistics across the whole phase space. The sampling was carried out using event generation filters at parton level as follows
      • ttbar: pT <100 GeV; pT in [100, 250] GeV; pT > 250 GeV
      • The scalar sum of the pT of outgoing particles for Z+Jet: ST < 250 Gev; ST in [250, 500] GeV; ST > 500 GeV
      • W/Z pT for dibosons: pT < 250 GeV; pT in [250, 500] GeV; pT > 500 GeV
  • Vector-like T-quarks with masses 1.0, 1.2, 1.4 TeV (hq1000, hq1200, hq14000) pair produced either through the Standard-Model gluon (wohg) or through a BSM 3TeV heavy gluon (hg3000)
  • tZ production through a Flavour Changing Neutral Current (fcnc) vertex

The samples are provided with both a full set of features, or with a sanitised set of features. The sanitised features remove some accumulation at zeros from non-reconstructed objects (i.e. missing values). All samples were generated using MadGraph5 2.6.5 and the detector was simulated using Delphes 3 with the default CMS card. For the Standard-Model background, both Pythia 8.2 (with CMS CUETP8M1 underlying event tune and NNPDF 2.3 parton distribution functions) (pythia) and Herwig 7 (herwig) hadronisations are provided to compare the background simulation. For the BSM signals only Pythia is provided.

For the details of the generation and on the differences between the two feature sets please refer to Finding new physics without learning about it: anomaly detection as a tool for searches at colliders for more details. Each file provides a train:validation:split with the ratios 1:1:1 to ensure equal statistical description of the events at each step of the machine learning workflow.

Files

Files (1.2 GB)

Name Size Download all
md5:57a3fec9bd02144c50c14f248a6aad6f
277.3 MB Download
md5:9348281798352faf0d4c5f0e8f10890b
193.0 MB Download
md5:1c1e2661b21d866df1f726459da6b046
245.7 MB Download
md5:e6951a37f67dc07ea97053fea314988c
278.5 MB Download
md5:4e37f901b7e4d0e4b64f2db86fd8b6f2
30.3 MB Download
md5:5df8f68cb80b02a8e59cb783354f13d4
20.9 MB Download
md5:d31d78246c90e610d98c38a46e753d21
11.4 MB Download
md5:126e39058b7940a55f4b146c1418d30c
8.6 MB Download
md5:fce499411dea59ff1a667dd356fdcd83
10.9 MB Download
md5:db69d606620a98d1607e14e62b04efbe
8.1 MB Download
md5:eae7c83845b447d4057e85b452d70d98
10.5 MB Download
md5:712ec8d85c9ec1ed3c91b4f9c40ba92d
7.9 MB Download
md5:b4b65ac8fb8b13619978e313e4318d0c
11.9 MB Download
md5:84d3bf71663982e946072f4fc14ba787
9.0 MB Download
md5:9f3b2985a25f005028c903266fa80a58
11.1 MB Download
md5:210697b2cdd73c253dc053815f0e08e4
8.4 MB Download
md5:df3fd3e7b5f64508d25654e6d80cf11e
10.6 MB Download
md5:2e9a27924179fd5911648996962f7b55
7.9 MB Download