Published 2023 | Version v5

Functional annotation of enzyme-encoding genes using deep learning with transformer layers

Description

Supplementary datasets for the paper "Improved annotation of enzyme-encoding genes using deep learning with transformer layers".

The datasets include Supplementary Data 1 (Predicted EC numbers for protein sequences from Swiss-Prot database using DeepECtransformer), Supplementary Data 2 (Visualization of the latent representations of enzyme sequences in the Swiss-Prot database using TMAP), Supplementary Data 3 (Commonly highlighted motifs for each EC number using DeepEC v2 neural network), Supplementary Data 4 (Sequences for each of strain specific alleles), Supplementary Data 5 (EC number prediction results for the y-ome proteins), Supplementary Data 6 (EC numbers of 128,100,490 protein sequences in 70,600 genomes in NCBI), and Supplementary Data 7 (Solubility prediction results for 295 y-ome proteins).

Files

Kim_etal_Supplementary_Data_3.zip

Files (694.1 MB)

Name Size
md5:bb9e9999658843079a20a86024f09f54
4.8 MB Download
md5:d19ce25fb475537adf47b604e16c9e16
51.3 MB Download
md5:07b471f6bab659e561d8a1b4b6e6d308
83.7 MB Preview Download
md5:2d164e5c7cc02f744a8da9005add8e8c
15.5 MB Download
md5:00d626dcab9af70389eb62e57a9443eb
42.6 kB Download
md5:b8d88316700ffadf33c49be4e612dd96
538.7 MB Download
md5:59c2cf822ac23384473e3f1a392aa665
34.2 kB Download