Oblique rotation of factors: a novel pattern recognition strategy to classify fluorescence excitation–emission matrices of human blood plasma for early diagnosis of colorectal cancer
- 30 March 2016
- journal article
- research article
- Published by Royal Society of Chemistry (RSC) in Molecular BioSystems
- Vol. 12 (6), 1963-1975
- https://doi.org/10.1039/c6mb00162a
Abstract
Colorectal cancer (CRC) ranks high in both men and women, accounting for about 13% of all cancers. In this study, a novel pattern recognition strategy is proposed to improve early diagnosis of CRC through visualizing the relationship between different spectral patterns in a case-control research. Partial least squares-discriminant analysis (PLS-DA) and supervised Kohonen network (SKN) were used to classify the fluorescence excitation–emission matrices (EEMs) from 289 human blood plasma samples containing CRC patients, adenomas tumor, other non-malignant findings and healthy individuals. To obtain optimal factors, oblique rotation (OR) and genetic algorithm (GA) were used to rotate the factors by optimizing transformation matrix elements. Transformed factors were introduced to SKN to build a classification model and the model performance was examined via comparison with a common classifier; PLS-DA. Classification models were built for CRC-healthy and adenomas-healthy samples and the best results were obtained through applying GA–OR on PLS factors and introducing them to the classifiers. Non-error rates for SKN and PLS-DA models assisted with GA (for selecting more informative PLS factors) and OR were equal to 0.97 and 0.95 in cross validation and 0.93 and 0.90 for prediction of the external test set, respectively. Moreover, according to the acceptable results for adenomas-healthy cases using optimal factors, CRC can be diagnosed in early stages. Combining classifiers and optimal factors proved to be efficient for distinguishing healthy and malignant samples, and OR can significantly improve performance of the classification model.Keywords
This publication has 69 references indexed in Scilit:
- Application of comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry method to identify potential biomarkers of perinatal asphyxia in a non-human primate modelJournal of Chromatography A, 2011
- Principal component directed partial least squares analysis for combining nuclear magnetic resonance and mass spectrometry data in metabolomics: Application to the detection of breast cancerAnalytica Chimica Acta, 2011
- Organization of GC/MS and LC/MS metabolomics data into chemical librariesJournal of Cheminformatics, 2010
- Chemometrics in metabolomics—A review in human disease diagnosisAnalytica Chimica Acta, 2010
- A network-QSAR model for prediction of genetic-component biomarkers in human colorectal cancerJournal of Theoretical Biology, 2009
- Multivariate paired data analysis: multilevel PLSDA versus OPLSDAMetabolomics, 2009
- Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition researchMetabolomics, 2009
- Comprehensive two-dimensional gas chromatography/time-of-flight mass spectrometry for metabonomics: Biomarker discovery for diabetes mellitusAnalytica Chimica Acta, 2009
- Analytical strategies for LC–MS-based targeted metabolomicsJournal of Chromatography B, 2008
- Identification of serum biomarkers for colon cancer by proteomic analysisBritish Journal of Cancer, 2006