Test-time training for deep MS/MS spectrum prediction improves peptide identification
Authors/Creators
- 1. University of Science and Technology of China
- 2. National Institute of Biological Sciences,
- 3. University of California, San Diego
Description
In bottom-up proteomics, peptide-spectrum matching is critical for peptide and protein identification. Recently, deep learning models have been used to predict tandem mass spectra of peptides, with the similarity scores of predicted and experimental spectra being integrated into peptide-spectrum matching. These models follow the supervised learning paradigm, which trains a general model using paired peptides and spectra from standard datasets and uses the model for prediction on experimental data. However, this approach can lead to inaccurate predictions due to differences between the training data and the experimental data, such as sample types, enzyme specificity, and instrument calibration. To address this issue, we proposed a Test-Time Training paradigm that adapts the pre-trained model to experimental data-specific models, namely PepT3. PepT3 results in a 10-40\% increase in peptide identification, depending on the distinctness of training and experimental data. Intriguingly, PepT3 improves the identification of tumor-specific neo-epitopes when applied to a complex patient-derived immunopeptidomic sample, with two-thirds of these neo-epitopes predicted to bind to the patient's human leukocyte antigen isoforms
Files
figure23-bekker.zip
Files
(34.0 GB)
| Name | Size | |
|---|---|---|
|
md5:0c74fa8daf51bf56be82251e4dfc9524
|
14.0 GB | Preview Download |
|
md5:4dce7ec5fa0df653d98d2f10fcffd69f
|
1.6 GB | Preview Download |
|
md5:a247cebfa664863e65d6cae9ea4d763c
|
5.5 GB | Preview Download |
|
md5:4e8b460c20efb78a83023548fca88c62
|
2.1 GB | Preview Download |
|
md5:e8fba6dda1db1afe6da451cfb47b95e0
|
979.1 MB | Preview Download |
|
md5:08e251ffb87aae5743549f312b60496f
|
9.8 GB | Preview Download |
|
md5:131f081307644cdd611324f43c84f79b
|
2.2 kB | Preview Download |