Published March 29, 2023 | Version v1.0.1
Software Open

Protein Function Embeddings: First Beta Release

  • 1. University of Bonn
  • 2. ZB MED- Information Centre for Life Sciences
  • 1. Bonn-Aachen International Centre for Information Technology (B-IT), University of Bonn
  • 2. University of Cologne

Description

This release corresponds to a thesis work that explores how information for protein functions can be exploited through embeddings so that the produced information can be used to improve protein function annotations. The underlying hypothesis here is that any pair of proteins with high sequence similarity will also share a similar biological function which would be reflected by the corresponding protein embeddings. The comparison and evaluation of this is done using two text-driven embedding approaches: Word2doc2Vec and Hybrid-Word2doc2Vec.

Files

protein-function-embeddings-thesis-1.0.1.zip

Files (86.7 kB)

Name Size Download all
md5:40df99f449979ad7401c23f9dc7018cd
36.1 kB Download
md5:ba8e4f517f1d96f93add0159a73f6465
50.6 kB Preview Download