Functional annotation of enzyme-encoding genes using deep learning with transformer layers
Description
Supplementary datasets for the paper "Improved annotation of enzyme-encoding genes using deep learning with transformer layers".
The datasets include Supplementary Data 1 (Predicted EC numbers for protein sequences from Swiss-Prot database using DeepECtransformer), Supplementary Data 2 (Visualization of the latent representations of enzyme sequences in the Swiss-Prot database using TMAP), Supplementary Data 3 (Commonly highlighted motifs for each EC number using DeepEC v2 neural network), Supplementary Data 4 (Sequences for each of strain specific alleles), Supplementary Data 5 (EC number prediction results for the y-ome proteins), Supplementary Data 6 (EC numbers of 128,100,490 protein sequences in 70,600 genomes in NCBI), and Supplementary Data 7 (Solubility prediction results for 295 y-ome proteins).
Files
Kim_etal_Supplementary_Data_3.zip
Files
(694.1 MB)
| Name | Size | |
|---|---|---|
|
md5:bb9e9999658843079a20a86024f09f54
|
4.8 MB | Download |
|
md5:d19ce25fb475537adf47b604e16c9e16
|
51.3 MB | Download |
|
md5:07b471f6bab659e561d8a1b4b6e6d308
|
83.7 MB | Preview Download |
|
md5:2d164e5c7cc02f744a8da9005add8e8c
|
15.5 MB | Download |
|
md5:00d626dcab9af70389eb62e57a9443eb
|
42.6 kB | Download |
|
md5:b8d88316700ffadf33c49be4e612dd96
|
538.7 MB | Download |
|
md5:59c2cf822ac23384473e3f1a392aa665
|
34.2 kB | Download |