Published September 20, 2024 | Version v1
Dataset Open

common function paralog pairs and sequence similarity features

  • 1. ROR icon University College Dublin

Description

This repository contains datasets of selected paralog pairs (Ensembl 111), labeled with various "common function" annotations, including PPI, SL, and GO datasets for both human (Homo sapiens) and budding yeast (Saccharomyces cerevisiae). These paralog pairs are characterized using different sequence similarity features, such as AlphaFold-predicted structures, Protein Language Model embeddings, and similarity searches from various databases.

Files

ens111_human_allFeatures.csv

Files (563.5 MB)

Name Size Download all
md5:87c68a7b6336d989abd4b5b3c76a518e
202.3 MB Preview Download
md5:d2d354b304c815a0dfc58372333da5b0
197.1 MB Preview Download
md5:71c5a8c38c79a33e0f1ee3ea8a6f48af
53.4 MB Preview Download
md5:b0aaa8e49b18218b40bd43735ce92492
20.5 MB Preview Download
md5:208542c70ea4e589c0d22abb9c086f36
27.9 MB Preview Download
md5:8081557a4fd1ebdbac0a0c7bffd8be93
49.3 MB Preview Download
md5:41d2c3305d7f12fd823a4fbf5a9197ec
4.5 MB Preview Download
md5:fea113830f3d90cbd8a767619c0d4f5d
604.2 kB Preview Download
md5:d817f29f47e745b742354e5f97a3fd22
673.5 kB Preview Download
md5:c720c09b6a4885d8a86d58119a9db20a
4.3 MB Preview Download
md5:e3f7e9622b1995963bba6e6a5cec1b7a
672.9 kB Preview Download
md5:d2bb3d39349c46d8a509e46495a135f9
2.2 MB Preview Download

Additional details

Funding

Science Foundation Ireland

Dates

Available
2024-09-25