Published May 9, 2024 | Version v1
Dataset Open

Vector-QM24 (VQM24) dataset

  • 1. ROR icon University of Toronto
  • 2. ROR icon Vector Institute

Description

DFT properties for all 784,875 conformers in local minima; 258,242 constitutional isomers (most stable conformer) and 51,072 saddle point structures are available in the DFT_all.npz, DFT_uniques.npz and DFT_saddles.npz files respectively.
DMC data for 10,793 constitutional isomers is available in the DMC.npz file.

All molecules are ordered in the same way across every array.

Keys for accessing each property are tabulated in the paper.

Usage example :

import numpy as np

data = np.load('DFT_all.npz', allow_pickle=True)
print(data.files) #see a list of all properties

key = 'freqs'

property = data[key] #DFT vibrational frequencies of all molecules
print(property[42]) #Frequencies of molecule number 42 in the array (HSCl, Thiohypochlorous acid)

 

Input file samples and tools : https://github.com/dkhan42/VQM24

 

Wavefunction files will be uploaded elsewhere (due to file size restrictions) and linked here.

Atomic energies (in Hartree) used to calculate the atomization energies :

#atomic energies wB97X-D3/cc-pVDZ
eatomic = {'Hydrogen' : -0.5012728848846926,
'Carbon' : -37.83859584856468,
'Nitrogen' : -54.5760607136932450,
'Oxygen' : -75.0474818911551438,
'Fluorine' : -99.7031524437270917,
'Bromine' : -2574.01253635198464,
'Chlorine' : -460.13960793480203,
'Phosphorous' : -341.2510291850040858,
'Sulfur' : -398.1021030909759020,
'Silicon' : -289.3578409507016431}

Files

Files (1.5 GB)

Name Size Download all
md5:f804fc13ead16c27a10f5d8ff2816511
1.1 GB Download
md5:82bfaf515f720d45cc5fe03e401b73f4
111.8 MB Download
md5:f2e32ff43232445e063795d5c6abbaed
337.6 MB Download
md5:565e295d845662d7df8e0dcca6db0d21
2.9 MB Download