Published September 20, 2024 | Version v1
Dataset Open

RNA large language models embeddings on benchmark datasets

  • 1. Universidad Nacional del Litoral
  • 2. Consejo Nacional de Investigaciones Científicas y Técnicas
  • 3. Consejo Nacional de Investigaciones Cientificas y Tecnicas

Description

This repository contains pre-computed embeddings for several RNA sequences, using most recent Large Language Models (LLM) pre-trained on RNA sequences. 

compressed files for each combination of RNA-LLM models and benchmarking RNA datasets. 

Files

Files (50.0 GB)

Name Size Download all
md5:2bc520e96382ece9ebdfefb3ff383efd
2.1 GB Download
md5:cd71b62babcb8cbc864b797171fd0caa
6.6 GB Download
md5:0665d160e0ca40e9addfd87c73917ea1
43.9 MB Download
md5:34bb70954278e7dcd8aa014abac26a34
471.9 kB Download
md5:14ff6cadf79bab3eac15a5e5eaa3fff1
1.6 MB Download
md5:e96ec01428d0e5c2d9b38bc6eae86398
13.0 kB Download
md5:69c4fabe0a312066886eaadb707ab481
3.5 GB Download
md5:f6b135b78fe0ba9f8ef2f566dabe5daf
11.2 GB Download
md5:36c04c06e7c37fe45ec98fa370d6a463
73.4 MB Download
md5:f3c5bb65bfd6879effdba856ad57dfa6
2.1 GB Download
md5:c71962cbb085a909356d7f841343768f
6.7 GB Download
md5:d832f4eec132201528896f38c52167f3
44.2 MB Download
md5:a176bf3b19afe248f672d51de47ec340
326.5 MB Download
md5:e4ab39f3c97361042ab2e1dc292f1722
1.1 GB Download
md5:45b960046b7879997b697f95b35a3207
6.9 MB Download
md5:ce0a1e020c3401db422cd54e14bcdb72
2.1 GB Download
md5:73cf138a8f9e16d473f84b5ec456d35f
6.6 GB Download
md5:1d6a9beb2a2ca508e12b401c016e6394
44.1 MB Download
md5:13fd77f9f42309b289f3d9a52e1d0b0e
1.8 GB Download
md5:6206884827be16e7863aaaa017c62f46
5.6 GB Download
md5:bbe709a44df74f7931f15b3125d5d112
36.9 MB Download

Additional details