There is a newer version of the record available.

Published January 25, 2023 | Version v1.0.0
Dataset Open

Supplementary data for: Tackling hysteresis in conformational sampling --- how to be forgetful with MEMENTO

  • 1. Department of Biochemistry, University of Oxford

Description

This is the supplementary data for our publication entitled: "Tackling hysteresis in conformational sampling --- how to be forgetful with MEMENTO".

Abstract:

<<<

The structure of proteins has long been recognised to hold the key to understanding and engineering their function, and rapid advances in structural biology (and protein structure prediction) are now supplying researchers with an ever-increasing wealth of structural information. Most of the time, however, structures can only be determined in free energy minima, one at a time. While conformational flexibility may thus be inferred from static end-state structures, their interconversion mechanisms — a central ambition of structural biology — are often beyond the scope of direct experimentation. Given the dynamical nature of the processes in question, many studies have attempted to explore conformational transitions using molecular dynamics (MD). However, ensuring proper convergence and reversibility in the predicted transitions is extremely challenging. In particular, a commonly used technique to map out a path from a starting to a target conformation called targeted MD (tMD) can suffer from starting-state dependence (hysteresis) when combined with techniques such as umbrella sampling (US) to compute the free energy profile of a transition.

Here, we study this problem in detail on conformational changes of increasing complexity. We also present a new, history-independent approach that we term “MEMENTO” (Morphing End states by Modelling Ensembles with iNdependent TOpologies) to generate paths that alleviate hysteresis in the construction of conformational free energy profiles. MEMENTO utilises template-based structure modelling to restore physically reasonable protein conformations based on coordinate interpolation (morphing) as an ensemble of plausible intermediates, from which a smooth path is picked. We compare tMD and MEMENTO on well-characterized test cases (the toy peptide deca-alanine and the enzyme adenylate kinase) before discussing its use in more complicated systems (the kinase P38α and the bacterial leucine transporter LeuT). Our work shows that for all but the simplest systems tMD paths should not in general be used to seed umbrella sampling or related techniques, unless the paths are validated by consistent results from biased runs in opposite directions. MEMENTO, on the other hand performs well as a flexible tool to generate intermediate structures for umbrella sampling. We also demonstrate that extended end-state sampling combined with MEMENTO can aid the discovery of collective variables on a case-by-case basis.

>>>

There are 4 folders in this dataset for the 4 simulation systems studied in the paper. Deca-alanine, ADK, P38a and LEUT. Each folder contains key coordinate files used to initialise MEMENTO and tMD runs, and data produced in the course of simulations (as labelled in the folders). In the simulation folders, 'raw_data' corresponds to the outputs produced by PLUMED during REUS or tMD, 'convergence' contains WHAM outputs taken at a certain percentage of the data included, and 'pointsfile' are the metadata files required by the Grossfield WHAM implementation.

Notes

Funding from the Wellcome Trust (218514/Z/19/Z), and compute resources from HECBioSim supported by EPSRC (EP/L000253).

Files

Files (868.1 MB)

Name Size Download all
md5:05513c7e1157c1d4f7c694a37a298f57
868.1 MB Download

Additional details

Funding

UK Research and Innovation
The UK High-End Computing Consortium for Biomolecular Simulation EP/L000253/1