Published January 12, 2023 | Version v3
Dataset Open

PIsToN: Evaluating Protein Binding Interfaces with Transformer Networks (dataset)

  • 1. Florida International University

Description

Computational protein-binding studies are widely used to investigate fundamental biological processes and facilitate the development of modern drugs, vaccines, and therapeutics. Scoring functions aim to assess and rank the binding strength of the predicted protein complex. Accurate scoring of protein binding interfaces remains a challenge. PIsToN (evaluating Protein binding Interfaces with Transformer Networks) represents a novel approach to distinguish native-like protein complexes from incorrect conformations. Protein interfaces are transformed into a collection of 2D images (interface maps), each corresponding to a geometric or biochemical property. Pixel intensities represent the feature values. A neural network was adapted from a popular vision transformer (ViT) with several enhancements: a hybrid component to accept empirical-based energy terms, a multi-attention module to highlight essential features and binding sites, and the use of contrastive learning for better ranking performance. The resulting PIsToN model significantly outperforms state-of-the-art scoring functions on well-known datasets.

This repository contains proteins and pre-computed interface maps for the PIsToN work.

Files

final_val.txt

Files (4.2 GB)

Name Size Download all
md5:b1d2c8468dcaf340d981806a54463d45
2.7 GB Download
md5:0b9978a44f9f2f1086a35358b55020a6
1.6 kB Preview Download
md5:26a048fe10586570ba2ef585752f7843
364.5 MB Download
md5:ccf340331efc6fdcb42dacd5cf83209e
6.2 kB Preview Download
md5:b3e00bc438703106c3b865b0611436db
1.1 GB Download
md5:d8127cb8e970bf423f6e05cf7c23532c
14.8 kB Preview Download
md5:fc4d18d1f48c7888bf665a4e8032fa24
14.3 kB Preview Download
md5:18a74960d3cfe12e41079b9eecfec4cc
12.5 kB Preview Download
md5:2339b434a42a05f7380ab8f16e5354c8
45.1 kB Preview Download

Additional details

Related works

Cites
Dataset: 10.5281/zenodo.2625420 (DOI)
Journal article: 10.1038/s41592-019-0666-6 (DOI)
Journal article: 10.1002/prot.24678 (DOI)
Journal article: 10.1038/s41467-021-27396-0 (DOI)
Is published in
Preprint: 10.1101/2023.01.03.522623 (DOI)