Published August 31, 2021 | Version V2
Dataset Open

CMS Open Data 2012 datasets for dimuon exercises

  • 1. ETH Zurich (CH)

Description

These datasets are a subset of the CMS Open data with 2021 data-taking conditions for education purposes.

In this version, the data and simulation files are compressed into one big file for easy access. They are stored in two different formats (CSV and PKL) with the same content, therefore just use one of them.

Once unzipped:

- Data files, starting with output_data_CMS_Run2012B, correspond to 4429.37 /pb of data collected by the CMS Experiment. They are a subset of the dataset on reference [1].

- Simulation files, starting with output_sim_CMS_MonteCarlo2012, are a subset of the dataset referenced on [2]. The number of generated events in this case is 30458871, and the cross section is 3503.71.

All the files were processed with a modified version of the AOD2NanoAODOutreachTool [3]. The small modifications are related to the number of triggers stored, and some objects like taus were removed.

 

--------------------------------------------------------

[1] CMS collaboration (2017). DoubleMuParked primary dataset in AOD format from Run of 2012 (/DoubleMuParked/Run2012B-22Jan2013-v1/AOD). CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.YLIC.86ZZ

[2] Wunsch, Stefan; (2019). DYJetsToLL dataset in reduced NanoAOD format for education and outreach. CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.SRRA.2GON

[3] https://github.com/cms-opendata-analyses/AOD2NanoAODOutreachTool

Notes

For the CSV files you might need to open them using pandas as: pandas.read_csv('output_data.csv', index_col=['entry','subentry']) For the pickle files, you might need to use python3.

Files

Files (6.7 GB)

Name Size Download all
md5:3295e80674813e59d059a68d537766c7
3.2 GB Download
md5:c70b0f0404f7bda89317ce35f84974ae
2.1 GB Download
md5:83ad8d857d3d803630955ec52fb916a9
868.2 MB Download
md5:45a7a9bf45bb9ca28974ea28e61ec580
568.5 MB Download

Additional details

References

  • CMS collaboration (2017). DoubleMuParked primary dataset in AOD format from Run of 2012 (/DoubleMuParked/Run2012B-22Jan2013-v1/AOD). CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.YLIC.86ZZ
  • Wunsch, Stefan; (2019). DYJetsToLL dataset in reduced NanoAOD format for education and outreach. CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.SRRA.2GON
  • https://github.com/cms-opendata-analyses/AOD2NanoAODOutreachTool