A large peptidome dataset improves HLA class I epitope prediction across most of the human population
- 1 February 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Biotechnology
- Vol. 38 (2), 199-+
- https://doi.org/10.1038/s41587-019-0322-9
Abstract
Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.Funding Information
- U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (T32HG002295)
This publication has 46 references indexed in Scilit:
- Six-locus high resolution HLA haplotype frequencies derived from mixed-resolution DNA typing for the entire US donor registryHuman Immunology, 2013
- Variable NK cell receptors and their MHC class I ligands in immunity, reproduction and human evolutionNature Reviews Immunology, 2013
- HLA-E and HLA-G Expression in Classical HLA Class I-Negative Tumors Is of Prognostic Value for Clinical Outcome of Early Breast Cancer PatientsThe Journal of Immunology, 2010
- An integrated approach to epitope analysis I: Dimensional reduction, visualization and prediction of MHC binding using amino acid principal components and regression approachesImmunome Research, 2010
- Derivation of an amino acid similarity matrix for peptide:MHC binding and its application as a Bayesian priorBMC Bioinformatics, 2009
- Peptide Binding to HLA Class I Molecules: Homogenous, High-Throughput Screening, and Affinity AssaysSLAS Discovery, 2009
- Balancing selection and heterogeneity across the classical human leukocyte antigen loci: A meta-analytic review of 497 population studiesHuman Immunology, 2008
- NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known SequencePLOS ONE, 2007
- Characterization of Peptides Bound to the Class I MHC Molecule HLA-A2.1 by Mass SpectrometryScience, 1992
- Statistical analysis of the physical properties of the 20 naturally occurring amino acidsJournal of Protein Chemistry, 1985