Published January 21, 2022
| Version 1
Dataset
Open
Predicting Exon Criticality from Protein Sequence
- 1. Wave Life Science
- 2. Wave Life Sciences
Description
Exon ByPASS (predicting Exon-skipping Based in Protein amino acid SequenceS), predictions on test exons from Human and Mouse transcripts. The exons in the test set from the two genomes are those that are not predicted to be skippable in hg38 and mm10 annotation and are also exons that in-frame when skipped. The preprocessed data includes the ensemble transcript id and exon rank as well the amino acid sequence for the upstream, downstream, and exon of interest. Additionally, the table contains the output probability from the model in the last column. The input data is transformed data of the amino acid sequence that is the Exon ByPASS model can use as an input.
Files
All_Human_Exons_Skip_Probability.csv
Files
(472.3 MB)
Name | Size | Download all |
---|---|---|
md5:8b149de57ea6984e2d74fd4c9b6b456d
|
178.6 MB | Preview Download |
md5:afe02bb333a44953205e6cb3a7bae667
|
122.7 MB | Preview Download |
md5:7df00e3d0536e11e6e35bc99a6c5e83a
|
56.9 MB | Preview Download |
md5:f1a54a37547765d30552a0d6a27362e5
|
41.8 MB | Preview Download |
md5:b2515f9e4a10aa66a9d29cd281a1c1dd
|
39.9 MB | Preview Download |
md5:8132dff5400bf1edf3251816521bac3d
|
32.3 MB | Preview Download |