Development and validation of machine learning-based risk prediction models of oral squamous cell carcinoma using salivary autoantibody biomarkers

Open Access

24 November 2022

journal article
research article
Published by Springer Science and Business Media LLC in BMC Oral Health

Vol. 22 (1), 1-10
https://doi.org/10.1186/s12903-022-02607-2

Abstract

The incidence of oral cavity squamous cell carcinoma (OSCC) continues to rise. OSCC is associated with a low average survival rate, and most patients have a poor disease prognosis because of delayed diagnosis. We used machine learning techniques to predict high-risk cases of OSCC by using salivary autoantibody levels and demographic and behavioral data. We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times. A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 ± 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 ± 0.06 to 0.795 ± 0.055, p < 0.001). We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.

Keywords

Funding Information

Ministry of Science and Technology, Taiwan (111-2636-E-A49-014, 108-2320-B-182-030-MY3)
Chang Gung Memorial Hospital (BMRPC77)

This publication has 59 references indexed in Scilit:

Expression of carbonic anhydrases I/II and the correlation to clinical aspects of oral squamous cell carcinoma analyzed using tissue microarray
Journal of Oral Pathology & Medicine, 2012
Tumor and Salivary Matrix Metalloproteinase Levels Are Strong Diagnostic Markers of Oral Squamous Cell Carcinoma
Cancer Epidemiology, Biomarkers & Prevention, 2011
Squamous cell carcinoma and precursor lesions of the oral cavity: epidemiology and aetiology
Periodontology 2000, 2011
Protein Microarray Signature of Autoantibody Biomarkers for the Early Detection of Breast Cancer
Journal of Proteome Research, 2010
Global epidemiology of oral and oropharyngeal cancer
Oral Oncology, 2009
Potentially malignant disorders of the oral and oropharyngeal mucosa; terminology, classification and present concepts of management
Oral Oncology, 2009
Head and neck cancer in the betel quid chewing area: recent advances in molecular carcinogenesis
Cancer Science, 2008
Does Pretreatment Seropositivity to Human Papillomavirus Have Prognostic Significance for Head and Neck Cancers?
Cancer Epidemiology, Biomarkers & Prevention, 2008
Univariate and multivariate analysis of prognostic significance of betel quid chewing in squamous cell carcinoma of buccal mucosa in Taiwan
Journal of Surgical Oncology, 2005
The Origins of Logistic Regression
SSRN Electronic Journal, 2003

Cited by 3 articles