Published June 4, 2024 | Version v2
Dataset Open

Datasets of DFT adsorption energies of H and for O and OH on different pure metals and binary intermetallic compounds considering the application of elastic strains and lists of candidates for screening

Description

This resource contains two datasets and two lists of candidates for screening in JSON format. Also It contains ZIP folders with all Quantum Espresso Inputs and outputs from which the JSON datasets were obtained. All Quantum Espresso outputs will be later added to Catalysis Hub (https://www.catalysis-hub.org/). The file "QuantumEspresso_versions" is a text file contaning the information of the Quantum Espresso versions employed for obtaining the dataset.

The datasets contain the adsorption energies for surface slabs of a large number of binary intermetallic compounds with different compositions and lattices (for instance, A3B fcc, A3B hpc, AB bcc, etc.). Adsorption energies were computed for different adsorbates (H, O, and OH) on distinct adsorption sites (e.g., fcc AAB, fcc AAA, hcp AAA, hcp AAB, on-top A, and on-top B) and minimum energy surfaces. In addition, different elastic strains (biaxial tension, biaxial compression) were applied to assess their effect on adsorption energies. All calculations were carried out using DFT approximations as implemented in the Open-source software Quantum Espresso. Besides the adsorption energies, the datasets also contain relevant geometric and electronic descriptors (PSI, cell volume, weighted atomic radius, generalized coordination number, weighted electronegativity, weighted first ionization energy, outer electrons, and biaxial strain)  calculated to feed them as features in the training of ML models. The datasets with the tag "scaled" on its name have the descriptors scaled following a MinMax scaling and are given in xlsx format.

The lists for screening contain candidates not included in the dataset for which Random Forest predictions of the Eads were obtained. The lists contain the geometric and electronic descriptors of all screening candidates, as well as the predicted adsorption energy (Eads_RF).

A GitHub repository is linked to this dataset (https://github.com/vvassilevg/HighHydrogenML). The repository contains two Python scripts:

1) Script for creating a dataset from QuantumEspresso outputs, where all relevant descriptors are computed. It outputs a pickle and json files that can be later converted to any other desired format (like xlsx).

2) Script for training a Random Forest model for the prediction of adsorption energies (the datasets with the "scaled" tag must be used for the script to work correctly).

 

The dataset, ML model and screening have been accepted for publication in Catalysis Science & Technology DOI: DOI:10.1039/D4CY00491D. The accepted Manuscript and the Supplementary information are avilable within this repository.

 

If you use this dataset or any of the files within this repository, please cite the original publication (10.1039/D4CY00491D) in your work.

Files

AcceptedManuscript_CatalSciTechnol2024.pdf

Files (1.4 GB)

Name Size Download all
md5:dc528abe03b8910a55887540333c8cc7
4.6 MB Preview Download
md5:6591402e8c2dbc07d7c30bbd7360d6f7
1.2 GB Preview Download
md5:e43f178eacd47ea60628d9a5b25849e9
96.6 kB Preview Download
md5:5e8f34f8faa2c12e932eb2d62018e29c
92.8 kB Download
md5:c3d20f8399b9fd634f87316f92d1efdf
179.5 kB Preview Download
md5:9446b3e7e280da4c2e6a5ef77abbea09
118.8 kB Download
md5:2b3e5ab288523ee3c03eed155e1ad97d
157.9 MB Preview Download
md5:a346da3304802339d5a9c5bdb6c018f5
630 Bytes Download
md5:074196f9ae4cb4fc7f8ca2099b05b34f
287.0 kB Preview Download
md5:e71989ef5f8788c258a6147b3c012a41
579.0 kB Preview Download
md5:ff3aec3bc24f231218161f41a7cb3b92
848.7 kB Preview Download

Additional details

Funding

European Commission
HighHydrogenML - High-throughput Discovery of Catalysts for the Hydrogen Economy through Machine Learning 101105610

Software

Repository URL
https://github.com/vvassilevg/HighHydrogenML.git
Programming language
Python
Development Status
Active