There is a newer version of the record available.

Published October 26, 2022 | Version 1.1
Dataset Open

Computed surface and chemical potentials, expansion coefficients, structures, models and results for the PMFPredictor Toolkit

  • 1. University College Dublin

Description

The PMFPredictor toolkit enables the prediction of the potentials of mean force describing the interaction between a surface and a small molecule in aqueous solution, which would otherwise be obtained from lengthy metadynamics simulations. This repository contains files to enable the operation of the toolkit, with source code available at https://github.com/ijrouse/PMFPredictor-Toolkit and corresponding to release v0.5-alpha.

In PMFPredictor-Repository.zip we provide supplementary data necessary for the operation of the PMFPredictor Toolkit including:

  • Structures of surfaces ("Structures/Surfaces") and chemicals ("Structures/Chemicals") in a united tabulated (.csv) format, listing x/y/z co-ordinates, atom IDs, mass (in amu), charge (in elementary units), Lennard Jones 6-12 parameters: sigma (in nm) and epsilon (in kJ/mol).
  • Interaction potentials of surfaces ("SurfacePotentials") and chemicals ("ChemicalPotentials") with probe atoms and molecules in tabulated format with distances relative to reference points in nm and energies in kJ/mol. Also included in these folders are the potentials with the molecular probes in individual files.
  • Hypergeometric expansion coefficients of the interaction potentials ("Datasets/SurfacePotentialCoefficientsNoise-1-oct12.csv" and "Datasets/ChemicalPotentialCoefficients-oct10.csv") in tabulated form, corresponding to potentials with units of nm for distance and kJ/mol for energy. Descriptions of the headers are provided in DatasetHeaderDescription.txt, included in the archive.
  • Trained TensorFlow models for the prediction of potentials of mean force from HG interaction coefficients, suitable for loading via the Keras backend.
  • PMFs generated for a range of surfaces and chemicals as output from the trained model, in both text format and figures showing comparisons to training PMFs where available. PMFs are supplied as tabulated data with comma separated values of distance in nm and interaction energies in kJ/mol.
  • Adsorption energies in kJ/mol evaluated at T=300K extracted from all PMFs and compared to the values obtained from known PMFs where available.

The surface_pmfpredictor.zip  archive contains PMFs selected for the operation of the UnitedAtom software package for the calculation of protein-nanoparticle interactions. This data is included in the main repository file and provided separately to avoid the download of unnecessary data if only the final PMFs are required. As with the main set, these are provided in tabulated form with distance [nm], energy [kJ/mol] pairs. This repository also contains the sets of figures illustrating these PMFs for each surface. Both archives contain further information on the contents, including descriptions of the surfaces and chemicals for which PMFs are computed. We also supply the training data used to build the model in a separate archive, PMFPredictor-TrainingData.zip, along with a text file containing descriptions of all headers in this file. This training data is quite large when uncompressed, c.a. 7 Gb, hence its exclusion from the main archive.

If you use results from this repository please cite the following paper in addition to the repository itself:

I. Rouse, V. Lobaskin, Machine-learning based prediction of small molecule -- surface interaction potentials,      arXiv:2211.07999
https://arxiv.org/abs/2211.07999

Notes

Updated metadata contained in repository to aid in re-use and added training data. Numerical values and models are unchanged from v1.0

Files

PMFPredictor-Repository.zip

Files (7.3 GB)

Name Size Download all
md5:d9f8480de3b1bff4c6201399b4f7467a
7.1 GB Preview Download
md5:2ca8608975c45380d02cab6cdbebad8b
183.9 MB Preview Download
md5:5da06cdd5812800ae4eff466e5d71650
106.9 MB Preview Download

Additional details

Funding

European Commission
NanoSolveIT - Innovative Nanoinformatics models and tools: towards a Solid, verified and Integrated Approach to Predictive (eco)Toxicology (NanoSolveIT) 814572