HiFreSP: A novel high-frequency sub-pathway mining approach to identify robust prognostic gene signatures
- 15 July 2019
- journal article
- research article
- Published by Oxford University Press (OUP) in Briefings in Bioinformatics
- Vol. 21 (4), 1411-1424
- https://doi.org/10.1093/bib/bbz078
Abstract
With the increasing awareness of heterogeneity in cancers, better prediction of cancer prognosis is much needed for more personalized treatment. Recently, extensive efforts have been made to explore the variations in gene expression for better prognosis. However, the prognostic gene signatures predicted by most existing methods have little robustness among different datasets of the same cancer. To improve the robustness of the gene signatures, we propose a novel high-frequency sub-pathways mining approach (HiFreSP), integrating a randomization strategy with gene interaction pathways. We identified a six-gene signature (CCND1, CSF3R, E2F2, JUP, RARA and TCF7) in esophageal squamous cell carcinoma (ESCC) by HiFreSP. This signature displayed a strong ability to predict the clinical outcome of ESCC patients in two independent datasets (log-rank test, P = 0.0045 and 0.0087). To further show the predictive performance of HiFreSP, we applied it to two other cancers: pancreatic adenocarcinoma and breast cancer. The identified signatures show high predictive power in all testing datasets of the two cancers. Furthermore, compared with the two popular prognosis signature predicting methods, the least absolute shrinkage and selection operator penalized Cox proportional hazards model and the random survival forest, HiFreSP showed better predictive accuracy and generalization across all testing datasets of the above three cancers. Lastly, we applied HiFreSP to 8137 patients involving 20 cancer types in the TCGA database and found high-frequency prognosis-associated pathways in many cancers. Taken together, HiFreSP shows higher prognostic capability and greater robustness, and the identified signatures provide clinical guidance for cancer prognosis. HiFreSP is freely available via GitHub: https://github.com/chunquanlipathway/HiFreSP.Keywords
Funding Information
- the Fundamental Research Funds for the Provincial Universities
- Scientific Research Fund of Harbin Medical University-Daqing (2018XN-20, DQXN201610)
- Yu Weihan Outstanding Youth Training Fund of Harbin Medical University
- Natural Science Foundation of Heilongjiang Province (F2016031, JJ2016ZR1232)
- Natural Science Foundation of China-Guangdong Joint Fund (U1601229)
- National Natural Science Foundation of China (81572341, 61601150)
This publication has 47 references indexed in Scilit:
- Subpathway-GM: identification of metabolic subpathways via joint power of interesting genes and metabolites and their topologies within pathwaysNucleic Acids Research, 2013
- Characterizing the Network of Drugs and Their Affected Metabolic SubpathwaysPLOS ONE, 2012
- Temporal Regulation of Rapamycin on Memory CTL Programming by IL-12PLOS ONE, 2011
- The Implications of Relationships between Human Diseases and Metabolic SubpathwaysPLOS ONE, 2011
- On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional dataBioinformatics, 2010
- L1 Penalized Estimation in the Cox Proportional Hazards ModelBiometrical Journal, 2010
- SubpathwayMiner: a software package for flexible identification of pathwaysNucleic Acids Research, 2009
- Focal adhesion regulation of cell behaviorBiochimica et Biophysica Acta (BBA) - Molecular Cell Research, 2004
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Possible paracrine growth of adenocarcinoma of the stomach induced by granulocyte colony stimulating factor produced by squamous cell carcinoma of the oesophagusGut, 2000