Datasets of DFT adsorption energies of H and for O and OH on different pure metals and binary intermetallic compounds considering the application of elastic strains and lists of candidates for screening
Authors/Creators
Description
This resource contains two datasets and two lists of candidates for screening in JSON format. Also It contains ZIP folders with all Quantum Espresso Inputs and outputs from which the JSON datasets were obtained. All Quantum Espresso outputs will be later added to Catalysis Hub (https://www.catalysis-hub.org/). The file "QuantumEspresso_versions" is a text file contaning the information of the Quantum Espresso versions employed for obtaining the dataset.
The datasets contain the adsorption energies for surface slabs of a large number of binary intermetallic compounds with different compositions and lattices (for instance, A3B fcc, A3B hpc, AB bcc, etc.). Adsorption energies were computed for different adsorbates (H, O, and OH) on distinct adsorption sites (e.g., fcc AAB, fcc AAA, hcp AAA, hcp AAB, on-top A, and on-top B) and minimum energy surfaces. In addition, different elastic strains (biaxial tension, biaxial compression) were applied to assess their effect on adsorption energies. All calculations were carried out using DFT approximations as implemented in the Open-source software Quantum Espresso. Besides the adsorption energies, the datasets also contain relevant geometric and electronic descriptors (PSI, cell volume, weighted atomic radius, generalized coordination number, weighted electronegativity, weighted first ionization energy, outer electrons, and biaxial strain) calculated to feed them as features in the training of ML models. The datasets with the tag "scaled" on its name have the descriptors scaled following a MinMax scaling and are given in xlsx format.
The lists for screening contain candidates not included in the dataset for which Random Forest predictions of the Eads were obtained. The lists contain the geometric and electronic descriptors of all screening candidates, as well as the predicted adsorption energy (Eads_RF).
A GitHub repository is linked to this dataset (https://github.com/vvassilevg/HighHydrogenML). The repository contains two Python scripts:
1) Script for creating a dataset from QuantumEspresso outputs, where all relevant descriptors are computed. It outputs a pickle and json files that can be later converted to any other desired format (like xlsx).
2) Script for training a Random Forest model for the prediction of adsorption energies (the datasets with the "scaled" tag must be used for the script to work correctly).
The dataset, ML model and screening have been accepted for publication in Catalysis Science & Technology DOI: DOI:10.1039/D4CY00491D. The accepted Manuscript and the Supplementary information are avilable within this repository.
If you use this dataset or any of the files within this repository, please cite the original publication (10.1039/D4CY00491D) in your work.
Files
AcceptedManuscript_CatalSciTechnol2024.pdf
Files
(1.4 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:dc528abe03b8910a55887540333c8cc7
|
4.6 MB | Preview Download |
|
md5:6591402e8c2dbc07d7c30bbd7360d6f7
|
1.2 GB | Preview Download |
|
md5:e43f178eacd47ea60628d9a5b25849e9
|
96.6 kB | Preview Download |
|
md5:5e8f34f8faa2c12e932eb2d62018e29c
|
92.8 kB | Download |
|
md5:c3d20f8399b9fd634f87316f92d1efdf
|
179.5 kB | Preview Download |
|
md5:9446b3e7e280da4c2e6a5ef77abbea09
|
118.8 kB | Download |
|
md5:2b3e5ab288523ee3c03eed155e1ad97d
|
157.9 MB | Preview Download |
|
md5:a346da3304802339d5a9c5bdb6c018f5
|
630 Bytes | Download |
|
md5:074196f9ae4cb4fc7f8ca2099b05b34f
|
287.0 kB | Preview Download |
|
md5:e71989ef5f8788c258a6147b3c012a41
|
579.0 kB | Preview Download |
|
md5:ff3aec3bc24f231218161f41a7cb3b92
|
848.7 kB | Preview Download |
Additional details
Funding
Software
- Repository URL
- https://github.com/vvassilevg/HighHydrogenML.git
- Programming language
- Python
- Development Status
- Active