Research Article | Open Access
Chao Chen, Songming Liu, Ruokai Qu, Bo Li, "Recurrent Neoantigens in Colorectal Cancer as Potential Immunotherapy Targets", BioMed Research International, vol. 2020, Article ID 2861240, 8 pages, 2020. https://doi.org/10.1155/2020/2861240
Recurrent Neoantigens in Colorectal Cancer as Potential Immunotherapy Targets
This study was aimed at investigating the mutations in colorectal cancer (CRC) for recurrent neoantigen identification. A total of 1779 samples with whole exome sequencing (WES) data were obtained from 7 published CRC cohorts. Common HLA genotypes were used to predict the probability of neoantigens at high-frequency mutants in the dataset. Based on the WES data, we not only obtained the most comprehensive CRC mutation landscape so far but also found 1550 mutations which could be identified in at least 5 patients, including KRAS G12D (8%), KRAS G12V (5.8%), PIK3CA E545K (3.5%), PIK3CA H1047R (2.5%), and BMPR2 N583Tfs44 (2.8%). These mutations can also be recognized by multiple common HLA molecules in Chinese and TCGA cohort as potential “public” neoantigens. Many of these mutations also have high mutation rates in metastatic pan-cancers, suggesting their value as therapeutic targets in different cancer types. Overall, our analysis provides recurrent neoantigens as potential cancer immunotherapy targets.
Colorectal cancer (CRC) is the third most common malignancy in the world and the second leading cause of cancer-related mortality [1, 2]. Traditional treatments, such as surgery, chemotherapy, and radiation, have been important in prolonging patients’ survival, but for patients with advanced CRC, especially those with metastatic disease, these treatments are limited and often intolerant .
In recent years, immunotherapy, including immune checkpoint inhibitors (ICIs), cancer vaccines, and neoantigen-based tumor-infiltrating lymphocytes (TILs), has played an increasingly important role in cancer therapy . Certain CRC patients with high microsatellite instability (MSI-H) could potentially benefit from ICIs treatment . However, not all CRC patients with MSI-H show clinical efficacy in ICI treatment. Neoantigen-based immunotherapy is complementary to ICIs since it has no specific requirement for patient’s MSI status nor tumor mutation burden (TMB) [6, 7]. Previous studies on CRC genomics mainly focused on the mechanism of tumor development and metastasis and less involves neoantigen and neoantigen-based immunotherapy [8–13]. By integrating the mutation data of already existing CRC cohorts and combining the common HLA genotypes in these populations [14, 15], our study is expected to find the common neoantigens in CRC patients and facilitate further development of off-the-shelf neoantigen-based immunotherapy.
2. Materials and Methods
2.1. Genomic Data of CRC
This study was approved by the Institutional Review Board on Bioethics and Biosafety of BGI group. All somatic mutations, including single-nucleotide variants (SNVs) and short insertion/deletion (indels), were downloaded from the latest publications (Table 1 and Supplementary Table S1), which represent seven geographically diverse study groups involving 1779 CRC patients. Since all data used in this study were from public databases with informed consent from participants in the original genome study, no additional informed consent was required.
2.2. Pipeline for Neoantigen Prediction
For neoantigen prediction, a total of 43 HLA genotypes were selected with frequencies greater than 5% in the Chinese or TCGA cohort. Mutations present in at least 5 patients, including 476 SNVs and 974 indels, were selected for subsequent neoantigen analysis. A total of five software are used for affinity prediction between neoantigen peptide and HLA alleles, which are NetMHC , NetMHCpan , PickPocket , PSSMHCpan , and SMM . Candidate high-affinity peptides were further predicted by EPIC . EPIC is a neoantigen prediction software based on mass spectrometry-derived motifs and tissue-specific expression profiles. It considers various complex factors in antigen presentation process, such as affinity and tumour-specific gene expression, and can accurately predict epitope presentation. If the sample is not quantified for gene expression, the software’s default expression value () is used. According to our previous research , neoantigen peptides need to meet the following four criteria: (1) between 8 and 11 meters in length; (2) affinity in at least two tools; (3) mutant (MT) peptides affinity score lower than the wild type (WT); and (4) the presentation score of .
2.3. Statistical Analysis
The statistical analysis was done in R-studio and the mutation analysis and drawing were done with the maftools package . If no special instructions were given, was considered significant.
3.1. The Integrated Mutation Landscape of CRC Patients
The mutation profiles of all CRC samples are shown in Figure 1 and Figure S1. In general, missense mutation is the main type of mutations. At the base substitution level, C>T is the dominant mutant type, followed by C>A (Figure S2), which is consistent with TCGA and previous reports [24, 25]. The median number of mutations in each sample was 110, and APC, MUC16, TP53, SYNE1, KRAS, and PIK3CA were the most frequently mutated genes. Apart from the samples without MSI status, there are 203 MSI-H samples in the combined cohort (Table 1), accounting for 11.4%. The mutation load of MSI-H samples was higher than that of MSS samples (Figure S3-S4). Interestingly, through the integration analysis of the MSS samples, we found the mutation of four genes, including TENM1, SOX9, PIK3CA, and KRAS, and the mutation of TP53 gene were mutually exclusive (Fisher’s exact test, , Figure S5). This different mutation pattern may suggest that the carcinogenic mechanisms are different in CRC patients carrying mutations in these four genes and in those carrying TP53 mutation. Correspondingly, there is no such mutual exclusion effect in the MSI-H samples (Figure S6).
There are many hotspot mutations in CRC samples. Of these, 1550 recurrent mutations could be identified in at least 5 patients, with 476 SNVs and 974 indels. Previous studies have shown that common neoantigens in cancers could be used as potential immunotherapy targets [22, 26]. Therefore, in order to find out whether there are common neoantigens in CRC populations, we used these mutations in downstream analyses to predict tumour-specific neoantigens.
3.2. Neoantigens Shared among CRC Patients
Due to the difference in the frequency of HLA in different populations, in order to search for “public” neoantigens in CRC populations, we selected high-frequency HLA in Chinese ( in Han Chinese ) and high-frequency HLA in TCGA ( in TCGA ) for neoantigen analysis. Finally, a total of 43 HLA alleles were used for neoantigen prediction (Table S2).
We detected 274 SNV-derived neoantigens and 1269 indel-derived neoantigens (Table S3-4). Each SNV usually produces 1-2 high-affinity peptides, while each indel can produce multiple high-affinity peptides. The top ten high-frequency SNV and indel-related neoantigens are shown in Tables 2 and 3, respectively. In terms of SNV, mutations of KRAS, PIK3CA, PCBP1, and CHEK2 can produce 10 neoantigens with the highest frequency (Table 2 and Table S3). In terms of indel, although the mutation frequency is not as high as SNV’s, generally one indel can produce about 5-10 neoantigen peptides (Table 3 and Table S4).
3.3. Comparison of Neoantigens in Different Subtypes of CRC
By comparing the neoantigen profiles of different subtypes of CRC, we found that there were more SNV- and indel-derived neoantigens in MSI-H patients than in MSS patients (Fisher’s exact test, , Figure 2(a)). The 1269 indel-related neoantigens can cover 86.7% of patients with MSI-H CRC, but only 3.9% of MSS patients were covered, indicating that indel is the main source neoantigens of MSI-H CRC. SNV-derived neoantigens can cover 41% of MSS patients and 66% of MSI-H patients.
Patients older than 60 carry more neoantigens than those under 60 years of age (Fisher’s exact test, , Figure 2(b)). Women tend to carry more neoantigens than men, especially those derived from indels. Female patients accounted for 62.2% in the MSI-H cohort and 51.2% in the MSS cohort. The proportion of MSI-H subtypes was higher in female CRC patients (Fisher’s exact test, value = 0.007); this may partly explain why the results show that women have a higher neoantigen load than men. In terms of cancer stage, Stage II CRC patients carry the most abundant neoantigens, which may be related to the mutation load of the corresponding subgroup (Figures 2(c) and 2(d)). We compared the percentage of patients with MSI-H in Stage II and other stages, finding 28% of patients are MSI-H in Stage II while 10.1% in other stages (Fisher’s exact test, value < 0.001). We also analyzed the mutated rates of KRAS and PIK3CA in Stage II CRC patients. For KRAS, the mutation rate was 28.8% in Stage II while 28.4% in other stages, without significant difference. For PIK3CA, the mutation rate of PIK3CA in Stage II patients was higher than that in other stages, with the mutation rate of 25.6% and 17.1%, respectively (Fisher’s exact test, value < 0.001). According to the above analysis results, the higher neoantigen load of patients at Stage II may be due to the higher proportion of MSI-H or the higher mutation rate of PIK3CA in these patients.
3.4. Hotspot Mutation-Related Neoantigens That May Be a Potential Source of Immunotherapy Target in CRC and Pan-Cancer
To further investigate the potential significance of these high-frequency neoantigens, we focused on mutations with the highest frequency, including KRAS G12D, KRAS G12V, PIK3CA E545K, and PIK3CA H1047R, because these mutations not only produce recurrent neoantigens but also have a higher frequency in the CRC cohort (Figures 3(a) and 3(b)).
KRAS Gly12 (including G12V, G12C, and G12D) is a classic driver mutation that occurs more than 20% in the metastatic pancreatic and appendiceal cancers . Both G12D and G12V are highly mutated in multiple metastatic cancers, including endometrial, CRC, and non-small cell lung cancer (Figures 4(a) and 4(b)). Mayakonda et al. have reported the high frequency of KRAS G12D mutations in pancreatic cancer [23, 25]. Liang et al. also demonstrated by mass spectrometry in 2019 that KRAS G12V-mutated neoantigen can be presented by HLA-A11:01 cell lines . And as far as we know, clinical trials for KRAS G12V mutations in patients with HLA-A11:01 are already under way (NCT03190941).
PIK3CA is one of the driver genes in gastrointestinal malignancies [29, 30]. PIK3CA E545K is a hotspot mutation in breast cancer and has corresponding first-line drugs (Alpelisib and Fulvestrant) . In addition to breast cancer, this mutation could be found in more than 5% bladder, head and neck, and colorectal cancer patients (Figure 4(c)). PIK3CA H1047R is most commonly found in breast cancer  and is also frequently mutated among multiple tumor types in the MSK-IMPACT metastatic cancers (Figure 4(d)). Our previous work has shown that the epitopes of this mutation can be presented by multiple HLA molecules (e.g., HLA-C07:02, HLA-C 07:01, HLA-A30:01, and HLA-B58:01) and can be a potential neoantigen in patients with gastric cancer . Combined with the results of this study, it is suggested that this mutation can be used as an important therapeutic target for patients with gastrointestinal tumors.
There are many different types of antigens that can be used as targets for immunotherapy, such as tumor-associated antigens, cancer-testis antigens, and neoantigens . Tumor-associated antigens (e.g., ERBB2/HER2) are highly expressed in tumor and poorly expressed in normal tissues, so they may serve as therapeutic targets for some tumor types, but the disadvantage is that they may cause nonspecific immune responses. Cancer-testis antigens (e.g., NY-ESO-1 antigen  and MAGE ) are not expressed in normal adult cells except in reproductive tissues (e.g., testis, fetal ovaries, and trophoblast cells). However, both tumor-associated antigens and cancer-testis antigens are prone to severe immune responses. For example, attempts to target melanoma-associated antigen 3 (MAGE-A3) with adoptive cell therapy have resulted in severe neurotoxicity and death, which may be related to the expression in the brain of MAGE-A family members that has not been previously recognized . Compared with the above two antigen types, neoantigen has stronger immunogenicity and tumor specificity, so it is an ideal target for immunotherapy [37, 38]. However, due to the difference of neoantigen among patients, the current neoantigen-based immunotherapy is completely individualized. Since the human exome region is about 30 M, the probability of the same mutation between different individuals is relatively low, and the epitope of the mutation is presented by specific HLA allele, so the possibility of the same neoantigen epitope between different individuals is very low.
However, not all mutations in tumors are random. Current genomic studies have shown that there are many hotspot mutations in driver genes, and the neoantigen epitopes formed by these mutations are potential “public” immunotherapy targets, such as KRAS G12D/V and CDK4 R24C/L . In CRC, the frequency of frameshift mutations is higher in patients with MSI-H subtype, and it has been reported that frameshift peptides have been used in the clinical treatment of CRC (NCT01461148). In our study, we also found that indel-related neoantigens can cover a large proportion of MSI-H CRC patients. At present, 10-20 neoantigen peptides are usually synthesized at the same time to prepare vaccines or other immunotherapy products [40, 41]. If combined with the neoantigens corresponding to hotspot point mutations and frameshift mutations, it is believed that these neoantigen can cover more CRC patients, so that more patients can benefit from neoantigen-based immunotherapy.
Based on the analysis of the published WES data of CRC, the most complete mutation landscape of CRC was obtained. We selected HLA subtypes with high frequency in Chinese and TCGA cohort to predict the common neoantigens in the population. The high-frequency mutations, including KRAS G12D (8%), KRAS G12V (5.8%), PIK3CA E545K (3.5%), PIK3CA H1047R (2.5%), and BMPR2 N583Tfs44 (2.8%), can be recognized and presented by many HLA genotypes, such as HLA-A1101, HLA-A03:01, and HLA-B57:01. These HLA genotypes are the main HLA subtypes in Chinese and Americans, indicating the broad spectrum of the neoantigens we identified. In conclusion, we have found a series of “public” neoantigens for CRC, which provide important resources for immunotherapy of CRC in the future.
Data supporting the results of this study are available from corresponding authors on request.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
We thank professor Xiuqing Zhang for her guidance and help in this project. This work is supported by the National Natural Science Foundation of China (No. 81702826), the Shenzhen Municipal Government of China (No. 20170731162715261), and the Science, Technology and Innovation Commission of Shenzhen Municipality (grant No. JCYJ20170817145845968).
Figure S1: the landscape of 25 significantly mutated genes in all the CRC patients. Figure S2: ratio of base conversion and transversion in mutations. Figure S3: the landscape of 25 significantly mutated genes in MSI-H CRC patients. Figure S4: the landscape of 25 significantly mutated genes in MSS CRC patients. Figure S5: somatic interaction analysis among the significantly mutated genes found four genes mutually exclusive with TP53 in MSS samples. Figure S6: somatic interaction analysis in MSI-H samples. Table S1: the list of clinical information of samples in previous seven studies of CRC. Table S2: the list of high-frequency HLA genotypes in Chinese and TCGA. Table S3: the list of SNV derived neoantigens. Table S4: the list of indel-derived neoantigens. (Supplementary Materials)
- F. Bray, J. Ferlay, I. Soerjomataram, R. L. Siegel, L. A. Torre, and A. Jemal, “Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” CA: a Cancer Journal for Clinicians, vol. 68, no. 6, pp. 394–424, 2018.
- M. Arnold, M. S. Sierra, M. Laversanne, I. Soerjomataram, A. Jemal, and F. Bray, “Global patterns and trends in colorectal cancer incidence and mortality,” Gut, vol. 66, no. 4, pp. 683–691, 2017.
- Z. Hu, J. Ding, Z. Ma et al., “Quantitative evidence for early metastatic seeding in colorectal cancer,” Nature Genetics, vol. 51, no. 7, pp. 1113–1122, 2019.
- C. A. Klebanoff, S. A. Rosenberg, and N. P. Restifo, “Prospects for gene-engineered T cell immunotherapy for solid cancers,” Nature Medicine, vol. 22, no. 1, pp. 26–36, 2016.
- J. C. Dudley, M. T. Lin, D. T. Le, and J. R. Eshleman, “Microsatellite instability as a biomarker for PD-1 blockade,” Clinical Cancer Research : an official journal of the American Association for Cancer Research, vol. 22, no. 4, pp. 813–820, 2016.
- P. Malekzadeh, A. Pasetto, P. F. Robbins et al., “Neoantigen screening identifies broad TP53 mutant immunogenicity in patients with epithelial cancers,” Journal of Clinical Investigation, vol. 129, no. 3, pp. 1109–1114, 2019.
- L. Ren, M. Leisegang, B. Deng et al., “Identification of neoantigen-specific T cells and their targets: implications for immunotherapy of head and neck squamous cell carcinoma,” Oncoimmunology, vol. 8, no. 4, article e1568813, 2019.
- “Comprehensive molecular characterization of human colon and rectal cancer,” Nature, vol. 487, no. 7407, pp. 330–337, 2012.
- M. Giannakis, X. J. Mu, S. A. Shukla et al., “Genomic correlates of immune-cell infiltrates in colorectal carcinoma,” Cell Reports, vol. 15, no. 4, pp. 857–865, 2016.
- K. Guda, M. L. Veigl, V. Varadan et al., “Novel recurrently mutated genes in African American colon cancers,” Proceedings of the National Academy of Sciences of the United States of America, vol. 112, no. 4, pp. 1149–1154, 2015.
- S. Seshagiri, E. W. Stawiski, S. Durinck et al., “Recurrent R-spondin fusions in colon cancer,” Nature, vol. 488, no. 7413, pp. 660–664, 2012.
- S. Vasaikar, C. Huang, X. Wang et al., “Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities,” Cell, vol. 177, no. 4, pp. 1035–1049.e19, 2019, e1019.
- Z. Liu, C. Yang, X. Li et al., “The landscape of somatic mutation in sporadic Chinese colorectal cancer,” Oncotarget, vol. 9, no. 44, pp. 27412–27422, 2018.
- E. J. M. dos Santos, A. McCabe, F. F. Gonzalez-Galarza, A. R. Jones, and D. Middleton, “Allele frequencies net database: improvements for storage of individual genotypes and analysis of existing data,” Human Immunology, vol. 77, no. 3, pp. 238–248, 2016.
- F. Zhou, H. Cao, X. Zuo et al., “Deep sequencing of the MHC region in the Chinese population contributes to studies of complex disease,” Nature Genetics, vol. 48, no. 7, pp. 740–746, 2016.
- C. Lundegaard, K. Lamberth, M. Harndahl, S. Buus, O. Lund, and M. Nielsen, “NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8-11,” Nucleic Acids Research, vol. 36, suppl_2, pp. W509–W512, 2008.
- I. Hoof, B. Peters, J. Sidney et al., “NetMHCpan, a method for MHC class I binding prediction beyond humans,” Immunogenetics, vol. 61, no. 1, pp. 1–13, 2009.
- H. Zhang, O. Lund, and M. Nielsen, “The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding,” Bioinformatics, vol. 25, no. 10, pp. 1293–1299, 2009.
- G. Liu, D. Li, Z. Li et al., “PSSMHCpan: a novel PSSM-based software for predicting class I peptide-HLA binding affinity,” GigaScience, vol. 6, no. 5, pp. 1–11, 2017.
- B. Peters and A. Sette, “Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method,” BMC Bioinformatics, vol. 6, no. 1, p. 132, 2005.
- W. Hu, S. Qiu, Y. Li et al., EPIC: MHC-I epitope prediction integrating mass spectrometry derived motifs and tissue-specific expression profiles, Biorxiv, 2019.
- C. Chen, Q. Zhou, R. Wu et al., “A comprehensive survey of genomic alterations in gastric cancer reveals recurrent neoantigens as potential therapeutic targets,” BioMed Research International, vol. 2019, Article ID 2183510, 10 pages, 2019.
- A. Mayakonda, D. C. Lin, Y. Assenov, C. Plass, and H. P. Koeffler, “Maftools: efficient and comprehensive analysis of somatic variants in cancer,” Genome Research, vol. 28, no. 11, pp. 1747–1756, 2018.
- K. Wang, S. T. Yuen, J. Xu et al., “Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer,” Nature Genetics, vol. 46, no. 6, pp. 573–582, 2014.
- Z. J. Zang, I. Cutcutache, S. L. Poon et al., “Exome sequencing of gastric adenocarcinoma identifies recurrent somatic mutations in cell adhesion and chromatin remodeling genes,” Nature Genetics, vol. 44, no. 5, pp. 570–574, 2012.
- J. Zhou, W. Zhao, J. Wu et al., “Neoantigens derived from recurrently mutated genes as potential immunotherapy targets for gastric cancer,” BioMed Research International, vol. 2019, Article ID 8103142, 11 pages, 2019.
- P. Charoentong, F. Finotello, M. Angelova et al., “Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade,” Cell Reports, vol. 18, no. 1, pp. 248–262, 2017.
- Z. Liang, L. Qin, L. Chen et al., The common neoantigens in colorectal cancer are predicted and validated to be presented or immunogenic, bioRxiv, 2019.
- The Cancer Genome Atlas Research Network, “Comprehensive molecular characterization of gastric adenocarcinoma,” Nature, vol. 513, no. 7517, pp. 202–209, 2014.
- C. Chen, C. Shi, X. Huang et al., “Molecular profiles and metastasis markers in Chinese patients with gastric carcinoma,” Scientific Reports, vol. 9, no. 1, p. 13995, 2019.
- D. Juric, F. Janku, J. Rodón et al., “Alpelisib plus Fulvestrant inPIK3CA-Altered andPIK3CA-Wild-Type estrogen receptor-positive advanced breast Cancer,” JAMA Oncology, vol. 5, no. 2, article e184475, 2019.
- J. A. Engelman, L. Chen, X. Tan et al., “Effective use of PI3K and MEK inhibitors to treat mutant _Kras_ G12D and _PIK3CA_ H1047R murine lung cancers,” Nature Medicine, vol. 14, no. 12, pp. 1351–1356, 2008.
- M. Yarchoan, B. A. Johnson III, E. R. Lutz, D. A. Laheru, and E. M. Jaffee, “Targeting neoantigens to augment antitumour immunity,” Nature Reviews Cancer, vol. 17, no. 4, pp. 209–222, 2017.
- S. Gnjatic, H. Nishikawa, A. A. Jungbluth et al., “NY‐ESO‐1: review of an immunogenic tumor antigen,” Advances in Cancer Research, vol. 95, pp. 1–30, 2006.
- P. Chomez, O. De Backer, M. Bertrand, E. De Plaen, T. Boon, and S. Lucas, “An overview of the MAGE gene family with the identification of all human members of the family,” Cancer Research, vol. 61, no. 14, pp. 5544–5551, 2001.
- R. A. Morgan, N. Chinnasamy, D. Abate-Daga et al., “Cancer regression and neurological toxicity following anti-MAGE-A3 TCR gene therapy,” Journal of Immunotherapy, vol. 36, no. 2, pp. 133–151, 2013.
- M. M. Richters, H. Xia, K. M. Campbell, W. E. Gillanders, O. L. Griffith, and M. Griffith, “Best practices for bioinformatic characterization of neoantigens for clinical utility,” Genome Medicine, vol. 11, no. 1, p. 56, 2019.
- P. Maby, J. Galon, and J. B. Latouche, “Frameshift mutations, neoantigens and tumor-specific CD8(+) T cells in microsatellite unstable colorectal cancers,” Oncoimmunology, vol. 5, no. 5, article e1115943, 2016.
- C. A. Klebanoff and J. D. Wolchok, “Shared cancer neoantigens: making private matters public,” Journal of Experimental Medicine, vol. 215, no. 1, pp. 5–7, 2018.
- D. B. Keskin, A. J. Anandappa, J. Sun et al., “Neoantigen vaccine generates intratumoral T cell responses in phase Ib glioblastoma trial,” Nature, vol. 565, no. 7738, pp. 234–239, 2019.
- T. N. Schumacher and R. D. Schreiber, “Neoantigens in cancer immunotherapy,” Science, vol. 348, no. 6230, pp. 69–74, 2015.
Copyright © 2020 Chao Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.