There is a newer version of the record available.

Published April 26, 2024 | Version v2
Dataset Open

HLApollo: Towards designing improved cancer immunotherapy targets with a superior peptide-MHC-I presentation model

Description

Based on the success of cancer immunotherapy, personalized cancer vaccines have recently emerged as the vanguard of oncology treatment. Because antigen presentation on MHC class I (MHC-I) is key to the adaptive immune response to cancerous cells, it is critical to have highly predictive computational methods to model which peptides are presented on MHC-I. Here, we introduce HLApollo, a transformer-based model with end-to-end treatment of MHC-I sequence, deconvolution of multi-allelic data, and ligand-flanking sequences. We develop negative-set switching, a novel training strategy that greatly reduces overfitting, which is key to HLApollo’s performance, leading to increases of 20.19% and 4.1% in average precision (AP) vs. next best model on MHC-I presentation and immunogenicity, respectively. Incorporating protein features derived from protein language models yielded further gains and reduced the need for gene expression measurements. We achieve excellent pan-allelic generalization, and create a framework for estimating performance on untrained alleles. This guides the clinical use of HLApollo, where rare alleles may be observed – particularly for individuals from underrepresented ancestries. Our work uses all facets of available MHC-I data to develop a highly accurate MHC-I presentation predictor that meaningfully improves immunogenicity prediction and allelic coverage, important for clinical applications of personalized neoantigen vaccines.

Files

Files (242.0 MB)

Name Size Download all
md5:a90ddcd6ebfcecd01cd04f8a62521753
239.1 MB Download
md5:93fae62a1badc66086fdb51ae5e4c279
598.7 kB Download
md5:bb460e18fe47ad669dbc216248382ed1
634.8 kB Download
md5:aeabeea172550eecd034f84dac600a6c
675.4 kB Download
md5:e55e62eb4fc9828a9abb31df3b979efb
201.0 kB Download
md5:244771976b082aeb44fa24f23c9c4e1d
196.0 kB Download
md5:d184ddd2d29f9f9e077f2ce4246b9a2c
205.8 kB Download
md5:d315fbe314ed22a0c4e68e560ca01012
357.2 kB Download

Additional details

Related works

Is supplemented by
Journal article: 10.1101/2022.12.08.519673 (DOI)