Published November 15, 2022 | Version Necessary customed database
Dataset Open

Prediction of inter-chain distance maps of protein complexes with 2D attention-based deep neural networks

Description

Residue-residue distance information is useful for predicting tertiary structures of protein monomers or quaternary structures of protein complexes. Many deep learning methods have been developed to predict intra-chain residue-residue distances of monomers accurately, but few methods can accurately predict inter-chain residue-residue distances of complexes. We develop a deep learning method CDPred (i.e., Complex Distance Prediction) based on the 2D attention-powered residual network to address the gap. Tested on two homodimer datasets, CDPred achieves the precision of 60.94% and 42.93% for top L/5 inter-chain contact predictions (L: length of the monomer in homodimer), respectively, substantially higher than DeepHomo’s 37.40% and 23.08% and GLINTER’s 48.09% and 36.74%. Tested on the two heterodimer datasets, the top Ls/5 inter-chain contact prediction precision (Ls: length of the shorter monomer in heterodimer) of CDPred is 47.59% and 22.87% respectively, surpassing GLINTER’s 23.24% and 13.49%. Moreover, the prediction of CDPred is complementary with that of AlphaFold2-multimer.

Files

Files (40.2 GB)

Name Size Download all
md5:24789f52791b510e463d74ba68f2106f
1.5 MB Download
md5:b82deac4ae99ea38ff6a1257794a699a
40.2 GB Download

Additional details

Funding

U.S. National Science Foundation
III: Medium: Collaborative Research: Guiding Exploration of Protein Structure Spaces with Deep Learning 1763246
U.S. National Science Foundation
ABI Innovation: Deep learning methods for protein bioinformatics 1759934