A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the underlying many-body physics?

Zhai, Yaoguang; Caruso, Alessandro; Bore, Sigbjørn Løland; Luo, Zhishang; Paesani, Francesco

doi:10.5281/zenodo.7577034

Published January 27, 2023 | Version 1.0

Dataset Open

A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the underlying many-body physics?

1. University of California, San Diego

Deep neural network (DNN) potentials have recently gained popularity in computer simulations of a wide range of molecular systems, from liquids to materials.
In this study, we explore the possibility of combining the computational efficiency of the DeePMD framework and the demonstrated accuracy of the MB-pol data-driven many-body potential to train a DNN potential for large-scale simulations of water across its phase diagram.
We find that the DNN potential is able to reliably reproduce the MB-pol results for liquid water but provides a less accurate description of the vapor-liquid equilibrium properties.
This shortcoming is traced back to the inability of the DNN potential to correctly represent many-body interactions.
An attempt to explicitly include information about many-body effects results in a new DNN potential that exhibits the opposite performance, being able to correctly reproduce the MB-pol vapor-liquid equilibrium properties but losing accuracy in the description of the liquid properties.
These results suggest that DeePMD-based DNN potentials are not able to correctly "learn" and, consequently, represent many-body interactions, which implies that DNN potentials may have limited ability to predict properties for state points that are not explicitly included in the training process.
The computational efficiency of the DeePMD framework can still be exploited to train DNN potentials on data-driven many-body potentials, which can thus enable large-scale, "chemically accurate" simulations of various molecular systems, with the caveat that the target state points must have been adequately sampled by the reference data-driven many-body potential in order to guarantee a faithful representation of the associated properties.

Notes

We thank Maria Muniz, Athanassios Panagiotopoulos, and Vinicius Cruzeiro for stimulating discussions at the early stage of this research. This research was supported by the Air Force Office of Scientific Research under award FA9550-20-1-0351 and used computational resources of the Department of Defense High Performance Computing Modernization Program (HPCMP) as well as the Triton Shared Computing Cluster (TSCC) at the San Diego Supercomputer Center (SDSC). This work was supported in part by the UC Southern California Hub, with funding from the UC National Laboratories division of the University of California Office of the President.

Files

DeePMD_training_inputs.zip

Files (2.6 GB)

Name	Size	Download all
DeePMD_Dataset_final.tar.bz2 md5:0449e95f66abb2609113e981f514034d	2.3 GB	Download
DeePMD_training_inputs.zip md5:4f65d14315b3b6decfee3f2b37b9112f	4.2 kB	Preview Download
frozen_potentials.tar.bz2 md5:43ac3f3c893f55f7cd39c404b3022105	258.6 MB	Download
LAMMPS_inputs.zip md5:90e0d62e4e2da2802af1d59c57dc8036	56.0 kB	Preview Download

Additional details

Is supplement to: Preprint: 10.26434/chemrxiv-2022-t92nd-v4 (DOI)

	All versions	This version
Views	390	389
Downloads	311	309
Data volume	1.8 TB	1.8 TB

A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the underlying many-body physics?

Authors/Creators

Description

Notes

Files

DeePMD_training_inputs.zip

Files (2.6 GB)

Additional details

Related works