Prediction of Breast Cancer Survival Through Knowledge Discovery in Databases
Open Access
- 16 December 2014
- journal article
- Published by Canadian Center of Science and Education in Global Journal of Health Science
- Vol. 7 (4), 392-398
- https://doi.org/10.5539/gjhs.v7n4p392
Abstract
The collection of large volumes of medical data has offered an opportunity to develop prediction models for survival by the medical research community. Medical researchers who seek to discover and extract hidden patterns and relationships among large number of variables use knowledge discovery in databases (KDD) to predict the outcome of a disease. The study was conducted to develop predictive models and discover relationships between certain predictor variables and survival in the context of breast cancer. This study is Cross sectional. After data preparation, data of 22,763 female patients, mean age 59.4 years, stored in the Surveillance Epidemiology and End Results (SEER) breast cancer dataset were analyzed anonymously. IBM SPSS Statistics 16, Access 2003 and Excel 2003 were used in the data preparation and IBM SPSS Modeler 14.2 was used in the model design. Support Vector Machine (SVM) model outperformed other models in the prediction of breast cancer survival. Analysis showed SVM model detected ten important predictor variables contributing mostly to prediction of breast cancer survival. Among important variables, behavior of tumor as the most important variable and stage of malignancy as the least important variable were identified. In current study, applying of the knowledge discovery method in the breast cancer dataset predicted the survival condition of breast cancer patients with high confidence and identified the most important variables participating in breast cancer survival.Keywords
This publication has 12 references indexed in Scilit:
- Survival Rate of Breast Cancer Based on Geographical Variation in Iran, a National StudyIranian Red Crescent Medical Journal, 2012
- Data Analysis and Data Mining: Current Issues in Biomedical InformaticsMethods of Information in Medicine, 2011
- Data Mining in GenomicsClinics in Laboratory Medicine, 2008
- Predicting Metastasis in Breast Cancer: Comparing a Decision Tree with Domain ExpertsJournal of Medical Systems, 2007
- Applications of Machine Learning in Cancer Prediction and PrognosisCancer Informatics, 2006
- Improvement of breast cancer relapse prediction in high risk intervals using artificial neural networksBreast Cancer Research and Treatment, 2005
- Predicting breast cancer survivability: a comparison of three data mining methodsArtificial Intelligence in Medicine, 2004
- Medical Decision Support Systems: Old Dilemmas and new Paradigms?Methods of Information in Medicine, 2003
- Uniqueness of medical data miningArtificial Intelligence in Medicine, 2002
- Artificial neural networks improve the accuracy of cancer survival predictionCancer, 1997