Published June 18, 2024 | Version 1.0
Dataset Open

AdsMT: Multi-modal Transformer for Predicting Global Minimum Adsorption Energy

Description

We built three Global Minimum Adsorption Energy (GMAE) benchmark datasets named OCD-GMAE, Alloy-GMAE and FG-GMAE from OC20-Dense, Catalysis Hub, and `functional groups' (FG)-dataset datasets through strict data cleaning, and each data point represents a unique combination of catalyst surface and adsorbate. These new benchmark datasets can be beneficial for future ML study on GMAE prediction.

In addition, a similar data cleaning procedure was employed on the OC20 dataset to create a new dataset named OC20-LMAE, which comprises surface/adsorbate pairings along with their local minimum adsorption energies (LMAE). The OC20-LMAE dataset contains 363,937 data points and serves as an effective resource for model pretraining.

Files

Files (322.0 MB)

Name Size Download all
md5:3c4eb0d619e07750756458346ee4f975
4.7 MB Download
md5:e58ad52ff537361c4dfdbda06769bfd9
3.0 MB Download
md5:13bd4592b82f30fe3a7db6599afa6dff
312.9 MB Download
md5:65ad65b7d5ba15d5604e3c6c9a39cc10
1.5 MB Download

Additional details

Funding

Swiss National Science Foundation
NCCR Catalysis (phase I) 180544