There is a newer version of the record available.

Published December 30, 2021 | Version V0.9
Dataset Open

Neural Networks for Structure-Informed Prediction of Formation Energy (employed in SIPFENN)

  • 1. The Pennsylvania State University

Description

Work Abstract:

In recent years, numerous studies have employed machine learning (ML) techniques to enable orders of magnitude faster high-throughput materials discovery by augmentation of existing methods or as standalone tools. In this paper, we introduce a new neural network-based tool for the prediction of formation energies based on elemental and structural features of Voronoi-tessellated materials. We provide a self-contained overview of the ML techniques used. Of particular importance is the connection between the ML and the true material-property relationship, how to improve the generalization accuracy by reducing overfitting, and how new data can be incorporated into the model to tune it to a specific material system.
    
    In the course of this work, over 30 novel neural network architectures were designed and tested. This lead to three final models optimized for (1) highest test accuracy on the Open Quantum Materials Database (OQMD), (2) performance in the discovery of new materials, and (3) performance at a low computational cost. On a test set of 21,800 compounds randomly selected from OQMD, they achieve mean average error (MAE) of 28, 40, and 42 meV/atom respectively. The second model provides better predictions on materials far from ones reported in OQMD, while the third reduces the computational cost by a factor of 8.
    
    We collect our results in a new open-source tool called SIPFENN (Structure-Informed Prediction of Formation Energy using Neural Networks). SIPFENN not only improves the accuracy beyond existing models but also ships in a ready-to-use form with pre-trained neural networks and a user interface. 

Files

NN20 R720 B2048_traininedNetMXnet.json

Files (5.4 GB)

Name Size Download all
md5:7e730d29ebb95d826fa25cce1d56359c
1.3 GB Download
md5:bf928f6a09eea6107dcaab2fde910307
3.4 kB Preview Download
md5:87990b144c6bb521825f82e2b384d268
1.3 GB Download
md5:8714c1f3819e5eb7762b449b3bf4b319
148.8 MB Download
md5:8ad0e4b7984fd5ef9291e6040efd8a9b
3.0 kB Preview Download
md5:e2e05eada22e2707ae708e6e23ffe27f
148.8 MB Download
md5:1baeeff4d3b5210d9d51b8a77e8f29f4
1.3 GB Download
md5:25d95a84aa6045ac99e1a1ed6c8a4903
2.6 kB Preview Download
md5:3f484bcc9307980c439992c309e1f189
1.3 GB Download
md5:7707f6863927b59bfca6271d5fe42812
46.6 MB Preview Download
md5:9850763b84bc4af1e8785c3a7ad4d6b3
9.7 MB Preview Download

Additional details

Related works

Is derived from
Preprint: arXiv:2008.13654 (arXiv)