Published January 21, 2022 | Version 1
Dataset Open

Predicting Exon Criticality from Protein Sequence

  • 1. Wave Life Science
  • 2. Wave Life Sciences

Description

Exon ByPASS (predicting Exon-skipping Based in Protein amino acid SequenceS), predictions on test exons from Human and Mouse transcripts. The exons in the test set from the two genomes are those that are not predicted to be skippable in hg38 and mm10 annotation and are also exons that in-frame when skipped. The preprocessed data includes the ensemble transcript id and exon rank as well the amino acid sequence for the upstream, downstream, and exon of interest. Additionally, the table contains the output probability from the model in the last column. The input data is transformed data of the amino acid sequence that is the Exon ByPASS model can use as an input.

Files

All_Human_Exons_Skip_Probability.csv

Files (472.3 MB)

Name Size Download all
md5:8b149de57ea6984e2d74fd4c9b6b456d
178.6 MB Preview Download
md5:afe02bb333a44953205e6cb3a7bae667
122.7 MB Preview Download
md5:7df00e3d0536e11e6e35bc99a6c5e83a
56.9 MB Preview Download
md5:f1a54a37547765d30552a0d6a27362e5
41.8 MB Preview Download
md5:b2515f9e4a10aa66a9d29cd281a1c1dd
39.9 MB Preview Download
md5:8132dff5400bf1edf3251816521bac3d
32.3 MB Preview Download