Published March 31, 2023 | Version 1.0
Dataset Open

WilkinsonAFIRdb and related

Authors/Creators

  • 1. ICReDD, Hokkaido University

Description

Databases for all data related to the article: "Challenges for Kinetics Predictions via Neural Network Potentials: a Wilkinson’s catalyst case"

Each dataset was created with ASE db, and can be explored with:

import ase.db

with ase.db.connect(db_path) as db:
    for row in db.select():
        atoms = row.toatoms() # ASE Atoms object
        data = row.data # Diverse information (energy, gradients and dipole, at DFT, xTB [and NNP or NNP(+xTB)], geometry type, reaction path network connection, ...)

Data labels: 

  • data['energy']: DFT energy [eV]
  • data['gradients']: DFT gradients [eV/A]
  • data['dipole']: DFT dipole [Debye]
  • data['xTB']['GFN2-xTB']['energy']: xTB energy [eV] (when available)
  • data['xTB']['GFN2-xTB']['gradients']: xTB gradients [eV/A] (when available)
  • data['xTB']['GFN2-xTB']['dipole']: xTB dipole [Debye] (when available)
  • data['E_pred']: Prediction energy [eV] (NNP, NNP(+xTB), xTB, depending on the dataset), if available
  • data['grad_pred']: Prediction gradients [eV/A]
  • data['dipole_pred']: Prediction gradients [Debye]
  • data['geo_type']: Type of geometry ('EQ': Equilibrium state, 'TS': Transition state, 'NODE': intermediary geometry, 'TSEQ': barrier-less TS [both path top and path endpoint])
  • data['EQ_id']: GRRM EQ number (sort of exploration timestamp on EQs), when available
  • data['TS_id']: GRRM path number (exploration timestamp on paths), when available
  • data['node_id']: Position in path, when available

Datasets:

  • WilkinsonAFIRdb.db: DFT-powered AFIR-based search data (including the single geometry with failed xTB convergence)
  • pureNNP_20%_dataset.zip: train/val/test data from NNP model trained on the first 20% of DFT paths explored
  • pureNNP_50%_dataset.zip: train/val/test data from NNP model trained on the first 50% of DFT paths explored
  • pureNNP_80%_dataset.zip: train/val/test data from NNP model trained on the first 80% of DFT paths explored
  • pureNNP_20%_localSearch.db: local NNP-powered AFIR-based search data, using NNP model trained on the first 20% of DFT paths explored
  • pureNNP_50%_localSearch.db: local NNP-powered AFIR-based search data, using NNP model trained on the first 50% of DFT paths explored
  • pureNNP_80%_localSearch.db: local NNP-powered AFIR-based search data, using NNP model trained on the first 80% of DFT paths explored
  • NNPxTB_20%_localSearch: local NNP-powered AFIR-based search data, using NNP(+xTB) model trained on the first 20% of DFT paths explored
  • NNPxTB_50%_localSearch: local NNP-powered AFIR-based search data, using NNP(+xTB) model trained on the first 50% of DFT paths explored
  • NNPxTB_80%_localSearch: local NNP-powered AFIR-based search data, using NNP(+xTB) model trained on the first 80% of DFT paths explored
  • xTB_localSearch: xTB-powered AFIR-based search data
  • NNPxTB_20%_globalSearch: global/full NNP-powered AFIR-based search data, using NNP(+xTB) model trained on the first 20% of DFT paths explored (EQ and TS only)
  • NNPxTB_50%_globalSearch: global/full NNP-powered AFIR-based search data, using NNP(+xTB) model trained on the first 50% of DFT paths explored (EQ and TS only)

Note: DFT level of theory is RωB97X-D/Def2-SVP

Notes

Additional data related to the full NNP(+xTB)-powered searches will be added in a future version

Files

NNPxTB_20%_dataset.zip

Files (2.2 GB)

Name Size Download all
md5:666d550226113fbbc362bded99794c6a
330.6 MB Preview Download
md5:452f092ec15013b0c9dc7b2275ac8ac8
32.0 MB Download
md5:c570ce5d5530323bdf4bb88841dc8c05
872.4 kB Download
md5:7420c3b197d6aab9a97b31e1328e050b
330.5 MB Preview Download
md5:46b8ab434ee4ac544a8aa016e4726079
34.4 MB Download
md5:2d0367c35cfdabfbd4da7701c2682202
757.8 kB Download
md5:d89c35788e26cff8c751e663dd8c15c3
330.5 MB Preview Download
md5:04397f1525160bcf91c78949bc217ca1
770.0 kB Download
md5:a38bd7c497b0373b84573563b2a5719f
218.1 MB Preview Download
md5:ea3b8dd0d35a7d5e5fe58ccbc5ac4ed5
1.1 MB Download
md5:2dac702d0d2cec5636efec5e3c090d85
218.1 MB Preview Download
md5:ac1fa590ec1c58fe362d5d10e261c176
1.0 MB Download
md5:976e8702c37fe7bba6b7cbc5ea6fd494
218.2 MB Preview Download
md5:0384805c250e2bdc7e44461a580ec669
843.8 kB Download
md5:291137d44fdf4bb275ad7aab7d8fda93
516.8 MB Download
md5:8044c320bdd57ca9744de43eb639892a
720.9 kB Download