Datasets of DFT adsorption energies of H and for O and OH on different pure metals and binary intermetallic compounds considering the application of elastic strains and lists of candidates for screening

Martínez-Alonso, Carmen; Vassilev-Galindo, Valentin; Comer, Benjamin; Abild-Pedersen, Frank; Winther, Kirsten; LLorca, Javier

doi:10.5281/zenodo.11486422

Published June 4, 2024 | Version v2

Dataset Open

Datasets of DFT adsorption energies of H and for O and OH on different pure metals and binary intermetallic compounds considering the application of elastic strains and lists of candidates for screening

1. IMDEA Materials

This resource contains two datasets and two lists of candidates for screening in JSON format. Also It contains ZIP folders with all Quantum Espresso Inputs and outputs from which the JSON datasets were obtained. All Quantum Espresso outputs will be later added to Catalysis Hub (https://www.catalysis-hub.org/). The file "QuantumEspresso_versions" is a text file contaning the information of the Quantum Espresso versions employed for obtaining the dataset.

The datasets contain the adsorption energies for surface slabs of a large number of binary intermetallic compounds with different compositions and lattices (for instance, A3B fcc, A3B hpc, AB bcc, etc.). Adsorption energies were computed for different adsorbates (H, O, and OH) on distinct adsorption sites (e.g., fcc AAB, fcc AAA, hcp AAA, hcp AAB, on-top A, and on-top B) and minimum energy surfaces. In addition, different elastic strains (biaxial tension, biaxial compression) were applied to assess their effect on adsorption energies. All calculations were carried out using DFT approximations as implemented in the Open-source software Quantum Espresso. Besides the adsorption energies, the datasets also contain relevant geometric and electronic descriptors (PSI, cell volume, weighted atomic radius, generalized coordination number, weighted electronegativity, weighted first ionization energy, outer electrons, and biaxial strain) calculated to feed them as features in the training of ML models. The datasets with the tag "scaled" on its name have the descriptors scaled following a MinMax scaling and are given in xlsx format.

The lists for screening contain candidates not included in the dataset for which Random Forest predictions of the Eads were obtained. The lists contain the geometric and electronic descriptors of all screening candidates, as well as the predicted adsorption energy (Eads_RF).

A GitHub repository is linked to this dataset (https://github.com/vvassilevg/HighHydrogenML). The repository contains two Python scripts:

1) Script for creating a dataset from QuantumEspresso outputs, where all relevant descriptors are computed. It outputs a pickle and json files that can be later converted to any other desired format (like xlsx).

2) Script for training a Random Forest model for the prediction of adsorption energies (the datasets with the "scaled" tag must be used for the script to work correctly).

The dataset, ML model and screening have been accepted for publication in Catalysis Science & Technology DOI: DOI:10.1039/D4CY00491D. The accepted Manuscript and the Supplementary information are avilable within this repository.

If you use this dataset or any of the files within this repository, please cite the original publication (10.1039/D4CY00491D) in your work.

Files

AcceptedManuscript_CatalSciTechnol2024.pdf

Files (1.4 GB)

Name	Size	Download all
AcceptedManuscript_CatalSciTechnol2024.pdf md5:dc528abe03b8910a55887540333c8cc7	4.6 MB	Preview Download
Bimetallic.zip md5:6591402e8c2dbc07d7c30bbd7360d6f7	1.2 GB	Preview Download
Dataset_Eads_H.json md5:e43f178eacd47ea60628d9a5b25849e9	96.6 kB	Preview Download
Dataset_Eads_H_scaled.xlsx md5:5e8f34f8faa2c12e932eb2d62018e29c	92.8 kB	Download
Dataset_Eads_O-OH.json md5:c3d20f8399b9fd634f87316f92d1efdf	179.5 kB	Preview Download
Dataset_Eads_O-OH_scaled.xlsx md5:9446b3e7e280da4c2e6a5ef77abbea09	118.8 kB	Download
Pure_metals.zip md5:2b3e5ab288523ee3c03eed155e1ad97d	157.9 MB	Preview Download
QuantumEspresso_versions md5:a346da3304802339d5a9c5bdb6c018f5	630 Bytes	Download
Screening_RF_H.json md5:074196f9ae4cb4fc7f8ca2099b05b34f	287.0 kB	Preview Download
Screening_RF_O-OH.json md5:e71989ef5f8788c258a6147b3c012a41	579.0 kB	Preview Download
supporting.pdf md5:ff3aec3bc24f231218161f41a7cb3b92	848.7 kB	Preview Download

Additional details

European Commission
HighHydrogenML - High-throughput Discovery of Catalysts for the Hydrogen Economy through Machine Learning 101105610

Repository URL: https://github.com/vvassilevg/HighHydrogenML.git
Programming language: Python
Development Status: Active

	All versions	This version
Views	392	246
Downloads	687	566
Data volume	89.9 GB	64.9 GB

EU Open Research Repository

EU Open Research Repository

AcceptedManuscript_CatalSciTechnol2024.pdf

Files (1.4 GB)

Funding

Software

About

Submission

EU Open Research Repository

EU Open Research Repository

Research and Innovation

Datasets of DFT adsorption energies of H and for O and OH on different pure metals and binary intermetallic compounds considering the application of elastic strains and lists of candidates for screening

Authors/Creators

Description

Files

AcceptedManuscript_CatalSciTechnol2024.pdf

Files (1.4 GB)

Additional details

Funding

Software