Published March 15, 2024 | Version 1.0.0
Dataset Open

heat of hydrogenation for diverse organic compounds -- experimental and calculated data for 166 unique reactions

Creators

Description

General remarks

The experimental data was drawn from reactions involving H2 that are available at NIST (accessed on 15/03/2024). Only reactions of type M + H2 => MH2, where M is a neutral, closed-shell organic molecule that accepts one equivalent of H2, were included in the collection. M corresponds to the oxidized form of the molecule ( => suffix '_ox'), MH2 to the reduced form (=> suffix '_red'). For reasons of clarity, the references to original publications were abbreviated in the main table (look up in separate table).

For the molecules involved, Smiles were manually assigned. From those, 3D structures were generated and evaluated in order to match the thermodynamic properties as accurately as possible (for details on the procedure refer to the related work, see below).

In addition to the experimental uncertainty, a significant scatter is seen for replicate measurements.

Please note: To compute the heat of hydrogenation from the calculated data for M/MH2 the contribution of H2 needs to be considered, take e.g. -1.164816 hartree (Energy at 298.15K, calculated at CCSD(T)=FULL/aug-cc-pVDZ) from CCCBDB (accessed on 15/03/2024).

 

Description of files

The file 01_heat_of_hydrogenation_XP+QM.csv contains experimentally measured and calculated data.

  • columns are separated by "|"
  • column names and explanations:
    • NIST_idx -- index of original reaction, mostly unique. In a few cases, data of the reverse reaction were subsumed under a different index
    • env -- if available, information about the environment a reported reaction took place in, e.g. gas phase, hexane, etc...
    • method -- if available, reference about the experimental technique, e.g. 'Eqk' = Heat of equilibrium, 'Cm' = Calorimetry, 'Chyd' = Calorimetry of hydrogenation
    • Temperature K -- if available, reported values 
    • reference -- Abbreviation of reference to original publication
    • experimental heat of reaction kJ/mol -- measured value as reported by experimentalists
    • experimental uncertainty -- if available, uncertainty of measurement reported by experimentalists
    • comments -- notes relating to identification of compounds
    • SMILES_ox -- isomeric canonical SMILES for oxidized form M
    • InChI_ox -- InChI for oxidized form M 
    • SMILES_red -- isomeric canonical SMILES for reduced form M
    • InChI_red -- InChI for reduced form M
    • reaction_index -- consequtively numbered for identical pairs (SMILES_ox, SMILES_red)
    • the calculated properties are given for the oxidized and reduced form of the molecule (in hartree)
      • E(B3LYP/6-31G(2df,p))
      • E_thermal
      • E(G4(MP2))@0K
      • E(G4(MP2))@298K
      • H(G4(MP2))
      • heat_of_formation@0K
      • heat_of_formation@298K

02_molecules.sdf: provides for each molecule a low-energy geometry along with some descriptors and calculated energetic properties:

  • coordinate block + bond information
  • properties
    • SMILES -- isomeric canonical smiles linking compound to reactions defined in 01_heat_of_hydrogenation_XP+QM.csv
    • radical_electrons -- number of unpaired electrons as determined by RDKit
    • empirical_formula -- elemental composition of molecule
    • molecular_weight -- as determined by RDKit in g/mol
    • TPSA --  topological polar surface area (TPSA) as determined by RDKit
    • logP -- octanol/water partition coefficient as predicted by RDKit
    • nof_heavy_atoms -- number of non-hydrogen atoms in molecule
    • degree_of_unsaturation -- sum of multiplebonds and/or rings present in the compound
    • rings -- number of rings in the compound as determined by RDKit
    • multiplicity -- spin multiplicity for use as input for QM calculations
    • nof_multiple_bonds -- number of multiple bonds as determined by RDKit
    • Std_InChI -- standard InChi
    • FixedH_InChI -- variant of InChI to differentiate tautomers
    • tag -- dataset label
    • total_atoms -- total number of atoms (including H)
    • net_charge -- total charge of molecule in units of elementary charge
    • energetic properties (in hartree) 

      • E(B3LYP/6-31G(2df,p))

      • E(HF/maug-cc-p(T+d)Z)

      • E(HF/CBS)

      • E(HF/maug-cc-p(Q+d)Z)

      • E(MP2/6-31G(d))

      • E(CCSD(T)/6-31G(d))

      • E(HF/G3MP2LARGEXP)

      • E(MP2/G3MP2LARGEXP)

      • DE(MP2) hartreeDE(HF)

      • ZPE(B3LYP) hartree

      • ZPE_scale_factor hartree

      • E(HLC) hartree

      • E_thermal hartree

      • H_thermal hartree

      • E(G4(MP2))@0K hartree

      • E(G4(MP2))@298K hartree

      • H(G4(MP2)) hartree

      • heat_of_formation@0K kcal/mol

      • heat_of_formation@298K kcal/mol

03_references.csv (separated by "|") lists abbreviations and corresponding full reference to original publication of individual data points.

Files

01_heat_of_hydrogenation_XP+QM.csv

Files (1.1 MB)

Name Size Download all
md5:42bd7e493effe9f0412935d8e71fb4ff
128.9 kB Preview Download
md5:f9681a6881d7558eb3d731a8b5c1d18b
993.0 kB Download
md5:8b182358b065590b58f9e060dff93d9b
12.0 kB Preview Download

Additional details

Related works

Is supplement to
Journal article: 10.1002/batt.202100059 (DOI)

Funding

Modelling for the search for new active materials for redox flow batteries 875489
European Commission