TEST DATA for Enhanced protein isoform characterization through long-read proteogenomics
Creators
- 1. University of Wisconsin
- 2. University of Virginia
- 3. Lifebit Biotech Ltd.
- 4. Science and Technology Consulting LLC
- 5. University of Zurich
Description
Test data for The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g. PacBio, Oxford Nanopore) provides full-length transcript sequencing, which can be used to predict full-length proteins. Here, we describe a long-read proteogenomics approach for integrating matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data in protein inference to enable detection of protein isoforms that are intractable to MS detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis.
Files
Files
(604.2 MB)
Name | Size | Download all |
---|---|---|
md5:ab070e9a3892d333455959bc0b7cbe26
|
29.1 MB | Download |
md5:84816aec673769a615233dca8270ed95
|
4.9 MB | Download |
md5:b0f5cdd4f75186f8a4d2e23378c57b5b
|
50.8 MB | Download |
md5:06386647ccb0e9942208a659ca761ee1
|
125.0 kB | Download |
md5:d6bfd335a049ce7173ba7366dc0d48bc
|
3.1 MB | Download |
md5:934a2d0c157d9060bad61ade0f4def89
|
56.0 MB | Download |
md5:0f3a0a1525ece57a15ad053674c88c1f
|
362.6 kB | Download |
md5:0b6ec27c462889cb8854e129c3420441
|
1.4 MB | Download |
md5:9f21dd434a148ae3ddb9fc16d124e098
|
427.5 MB | Download |
md5:1ef7d3d031b223776fca759f1e16df2e
|
70 Bytes | Download |
md5:23b3576e87c3f849a9e767dab2b71f24
|
855.6 kB | Download |
md5:b936a6638601e61c46a12fde00181c30
|
855.6 kB | Download |
md5:c89521f295edb10316b62aa89cbf9210
|
29.1 MB | Download |