Identification of leukemia stem cell expression signatures through Monte Carlo feature selection strategy and support vector machine
- 29 May 2019
- journal article
- research article
- Published by Springer Science and Business Media LLC in Cancer Gene Therapy
- Vol. 27 (1-2), 56-69
- https://doi.org/10.1038/s41417-019-0105-y
Abstract
Acute myeloid leukemia (AML) is a type of blood cancer characterized by the rapid growth of immature white blood cells from the bone marrow. Therapy resistance resulting from the persistence of leukemia stem cells (LSCs) are found in numerous patients. Comparative transcriptome studies have been previously conducted to analyze differentially expressed genes between LSC+ and LSC− cells. However, these studies mainly focused on a limited number of genes with the most obvious expression differences between the two cell types. We developed a computational approach incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), incremental feature selection (IFS), support vector machine (SVM), Repeated Incremental Pruning to Produce Error Reduction (RIPPER), to identify gene expression features specific to LSCs. One thousand 0ne hudred fifty-nine features (genes) were first identified, which can be used to build the optimal SVM classifier for distinguishing LSC+ and LSC− cells. Among these 1159 genes, the top 17 genes were identified as LSC-specific biomarkers. In addition, six classification rules were produced by RIPPER algorithm. The subsequent literature review on these features/genes and the classification rules and functional enrichment analyses of the 1159 features/genes confirmed the relevance of extracted genes and rules to the characteristics of LSCs.Keywords
This publication has 101 references indexed in Scilit:
- Selective elimination of leukemia stem cells: Hitting a moving targetCancer Letters, 2013
- Endoplasmic reticulum protein GliPR1 regulates G protein signaling and the cell cycle and is overexpressed in AMLOncology Reports, 2013
- Specificity factors in cytoplasmic polyadenylationWIREs RNA, 2013
- Progesterone Receptor-Mediated Regulation of N-Acetylneuraminate Pyruvate Lyase (NPL) in Mouse Uterine Luminal Epithelium and Nonessential Role of NPL in Uterine FunctionPLOS ONE, 2013
- NCBI GEO: archive for functional genomics data sets—updateNucleic Acids Research, 2012
- Prediction of Protein Cleavage Site with Feature Selection by Random ForestPLOS ONE, 2012
- Aberrant expression of RasGRP1 cooperates with gain-of-function NOTCH1 mutations in T-cell leukemogenesisLeukemia, 2011
- The derivation of diagnostic markers of chronic myeloid leukemia progression from microarray dataBlood, 2009
- A Genome-wide Short Hairpin RNA Screening of Jurkat T-cells for Human Proteins Contributing to Productive HIV-1 ReplicationOnline Journal of Public Health Informatics, 2009
- CD96 is a leukemic stem cell-specific marker in human acute myeloid leukemiaProceedings of the National Academy of Sciences of the United States of America, 2007