Published December 9, 2019 | Version 3.0
Software Open

DROIDSv3.0/maxDemonv1.0 - software for machine learning applied to comparative protein dynamics

  • 1. Rochester of Institute of Technology

Contributors

Project member:

  • 1. Rochester Institute of Technology

Description

The application of statistical methods to comparatively framed questions about the molecular dynamics (MD) of proteins can potentially enable investigations of biomolecular function beyond the current sequence and structural methods in bioinformatics. However, the chaotic behavior in single MD trajectories requires statistical inference that is derived from large ensembles of simulations representing the comparative functional states of a protein under investigation. Meaningful interpretation of such complex forms of big data poses serious challenges to users of MD. Here, we announce DROIDS 3.0, a method and software package for comparative protein dynamics that includes maxDemon 1.0, a multi-method machine learning application that trains on large ensemble comparisons of concerted protein motions in opposing functional states generated by DROIDS, and deploys learned classifications of these states onto newly generated MD simulations. Local canonical correlations in learning patterns generated from independent, yet identically prepared, MD validation runs are used to identify regions of functionally conserved protein dynamics. The subsequent impacts of genetic and/or drug class variants on conserved dynamics can also be analyzed by deploying the classifiers on variant MD simulations and quantifying how often these altered protein systems display opposing functional states. Here, we present several case studies of complex changes in functional protein dynamics caused by temperature, genetic mutation, and binding interactions with nucleic acids and small molecules. We demonstrate that our machine learning algorithm can properly identify regions of functionally conserved dynamics in ubiquitin and TATA binding protein (TBP). We quantify the impact of genetic variation in TBP and drug class variation targeting the ATP binding region of Hsp90 on conserved dynamics. We identify regions of conserved dynamics in Hsp90 that connect the ATP binding pocket to other functional regions. We also demonstrate that dynamic impacts of various Hsp90 inhibitors rank accordingly with how closely they mimic natural ATP binding.

Notes

also posted at GitHub https://github.com/gbabbitt/DROIDS-3.0-comparative-protein-dynamics

Files

citation1.pdf

Files (21.7 MB)

Name Size Download all
md5:6f993dfad407880518bdab0a6a4a73b9
2.2 MB Preview Download
md5:d1899e9b3fbcfeabf053548553b6cb15
2.1 MB Preview Download
md5:3e4358bdbcbdcca244f1a2e20e4174aa
10.7 MB Preview Download
md5:a2a5d8fecea7e4a32744759d1c2dd258
5.1 MB Preview Download
md5:51fc62258bd72ddc8d5986f3a4ac10c2
1.6 MB Preview Download

Additional details