There is a newer version of this record available.

Dataset Open Access

Data and Weights for Reverse Homology

Alex X Lu; Amy X Lu; Iva Pritišanac; Taraneh Zarin; Julie D Forman-Kay; Alan M Moses

Training data, weights, and classification datasets for "Discovering molecular features of intrinsically disordered regions by using evolution for contrastive learning". 

Training data:

  • scer_idr_homologues and human_idr_homologues contain a zip file of the fasta files of IDR homologues used to train the yeast and human model, respectively. Note that these fasta files are aligned, but we strip away the alignment symbol "-" before input into our model. 

Weights:

  • scer_idr_model and human_idr_model contain a zip file of the weights for the yeast and human model respectively, which can be loaded into the model files at github.com/alexxijielu/reverse_homology. It also contains z-scores of all of the model features across all IDRs, which are required to run the mutational scanning map code. 

Classification datasets:

  • IDR_classification_datasets contains datasets used in our benchmarks. These datasets are encoded as binary csv matrixes. cdc28_classification contains IDRs labeled as Cdc28 phosphorylation sites, mitochondrial_targeting_classification contains IDRs labeled as mitochondrial targeting signals, evosig_cluster_classification contains IDRs labeled by clusters assigned in previous computational work by Zarin et al. eLife 2019, and go_SLIM_classification contains proteins labeled by GO Slim annotations. 
Files (156.3 MB)
Name Size
human_idr_homologues.zip
md5:45515d51eacb232c5ccb901cebb07b61
86.5 MB Download
human_idr_model.zip
md5:7a8ef3f724c9dc1d165dee8394690b3d
43.1 MB Download
IDR_classification_datasets.zip
md5:42b078e818b4deeb02afe8dcbb6f6df1
167.7 kB Download
scer_idr_homologues.zip
md5:0964280f9faaaa9c71f2390dd08631d2
6.6 MB Download
scer_idr_model.zip
md5:378ad0cb8433af659b98803ab241a4a0
19.9 MB Download
143
83
views
downloads
All versions This version
Views 143101
Downloads 8358
Data volume 3.0 GB2.2 GB
Unique views 11894
Unique downloads 3324

Share

Cite as