Published January 31, 2025 | Version v1
Dataset Open

DFT data for "Molecular Simulations with a Pretrained Neural Network and Universal Pairwise Force Fields"

Description

This repository contains quantum mechanical datasets used for training and testing the SO3LR machine-learned force field model in Molecular Simulations with a Pretrained Neural Network and Universal Pairwise Force Fields.

The combined training set, provided in extxyz format and compressed as so3lr_train.tar.gz, includes GEMS general protein fragments, QM7-X small organic molecules, AQM large drug-like molecules, SPICE dipeptides, and DES15k dimers (see the "T – Optimization on diverse training data" subsection of the original manuscript for details). To maintain consistent references, the last two datasets were recomputed at the PBE0+MBD/tight level of theory (see FHI-aims input file control.in).

Test sets from Table I of the original manuscript are available in so3lr_test.tar.gz. Structures from the MD22 and TorsionNet500 datasets were also recomputed at the PBE0+MBD/tight level of theory.

Files

README.txt

Files (3.6 GB)

Name Size Download all
md5:99b347a2448a184adcf303a16ccc9299
25.5 kB Download
md5:1c1ef413ac8c7cde5e3bbba80848837d
106.9 MB Download
md5:3cdc27d4f9a41b285389d39a9d97d47a
1.9 kB Download
md5:411cfdb74f06f6e3838fcb1e0d0e2933
901 Bytes Preview Download
md5:5b6c87fd316900dce3b419a0dad13341
74.8 MB Download
md5:5f485509021f38cc709cfaa768241b1b
2.9 GB Download
md5:21d5e78b5652970adc79b5264dd08b4b
399.7 MB Download
md5:3a0b55570f305bed31c78382116a7d80
108.2 MB Download