PIsToN: Evaluating Protein Binding Interfaces with Transformer Networks (dataset)
Authors/Creators
- 1. Florida International University
Description
Computational protein-binding studies are widely used to investigate fundamental biological processes and facilitate the development of modern drugs, vaccines, and therapeutics. Scoring functions aim to assess and rank the binding strength of the predicted protein complex. Accurate scoring of protein binding interfaces remains a challenge. PIsToN (evaluating Protein binding Interfaces with Transformer Networks) represents a novel approach to distinguish native-like protein complexes from incorrect conformations. Protein interfaces are transformed into a collection of 2D images (interface maps), each corresponding to a geometric or biochemical property. Pixel intensities represent the feature values. A neural network was adapted from a popular vision transformer (ViT) with several enhancements: a hybrid component to accept empirical-based energy terms, a multi-attention module to highlight essential features and binding sites, and the use of contrastive learning for better ranking performance. The resulting PIsToN model significantly outperforms state-of-the-art scoring functions on well-known datasets.
This repository contains proteins and pre-computed interface maps for the PIsToN work.
Files
final_val.txt
Files
(4.2 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:b1d2c8468dcaf340d981806a54463d45
|
2.7 GB | Download |
|
md5:0b9978a44f9f2f1086a35358b55020a6
|
1.6 kB | Preview Download |
|
md5:26a048fe10586570ba2ef585752f7843
|
364.5 MB | Download |
|
md5:ccf340331efc6fdcb42dacd5cf83209e
|
6.2 kB | Preview Download |
|
md5:b3e00bc438703106c3b865b0611436db
|
1.1 GB | Download |
|
md5:d8127cb8e970bf423f6e05cf7c23532c
|
14.8 kB | Preview Download |
|
md5:fc4d18d1f48c7888bf665a4e8032fa24
|
14.3 kB | Preview Download |
|
md5:18a74960d3cfe12e41079b9eecfec4cc
|
12.5 kB | Preview Download |
|
md5:2339b434a42a05f7380ab8f16e5354c8
|
45.1 kB | Preview Download |
Additional details
Related works
- Cites
- Dataset: 10.5281/zenodo.2625420 (DOI)
- Journal article: 10.1038/s41592-019-0666-6 (DOI)
- Journal article: 10.1002/prot.24678 (DOI)
- Journal article: 10.1038/s41467-021-27396-0 (DOI)
- Is published in
- Preprint: 10.1101/2023.01.03.522623 (DOI)