Inclusion of genetic variants in an ensemble of gradient boosting decision trees does not improve the prediction of citalopram treatment response

Open Access

12 February 2021

journal article
research article
Published by Springer Science and Business Media LLC in Scientific Reports

Vol. 11 (1), 1-9
https://doi.org/10.1038/s41598-021-83338-2

Abstract

Identifying in advance who is unlikely to respond to a specific antidepressant treatment is crucial to precision medicine efforts. The current work leverages genome-wide genetic variation and machine learning to predict response to the antidepressant citalopram using data from the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) trial (n = 1257 with both valid genomic and outcome data). A confirmatory approach selected 11 SNPs previously reported to predict response to escitalopram in a sample different from the current study. A novel exploratory approach selected SNPs from across the genome using nested cross-validation with elastic net logistic regression with a predominantly lasso penalty (alpha = 0.99). SNPs from each approach were combined with baseline clinical predictors and treatment response outcomes were predicted using a stacked ensemble of gradient boosting decision trees. Using pre-treatment clinical and symptom predictors only, out-of-fold prediction of a novel treatment response definition based on STAR*D treatment guidelines was acceptable, AUC = .659, 95% CI [0.629, 0.689]. The inclusion of SNPs using confirmatory or exploratory selection methods did not improve the out-of-fold prediction of treatment response (AUCs were .662, 95% CI [0.632, 0.692] and .655, 95% CI [0.625, 0.685], respectively). A similar pattern of results were observed for the secondary outcomes of the presence or absence of distressing side effects regardless of treatment response and achieving remission or satisfactory partial response, assuming medication tolerance. In the current study, incorporating SNP variation into prognostic models did not enhance the prediction of citalopram response in the STAR*D sample.

Funding Information

National Institutes of Health

This publication has 47 references indexed in Scilit:

A machine learning approach using EEG data to predict response to SSRI treatment for major depressive disorder
Clinical Neurophysiology, 2013
The efficacy of psychotherapy and pharmacotherapy in treating depressive and anxiety disorders: a meta‐analysis of direct comparisons
World Psychiatry, 2013
Contribution of Common Genetic Variants to Antidepressant Response
Biological Psychiatry, 2013
The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
PLoS Genetics, 2010
A Genomewide Association Study of Citalopram Response in Major Depressive Disorder
Biological Psychiatry, 2010
Novel loci for major depression identified by genome-wide association study of Sequenced Treatment Alternatives to Relieve Depression and meta-analysis of three studies
Molecular Psychiatry, 2009
The FKBP5-Gene in Depression and Treatment Response—an Association Study in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) Cohort
Biological Psychiatry, 2008
Super Learner
Statistical Applications in Genetics and Molecular Biology, 2007
Variation in the Gene Encoding the Serotonin 2A Receptor Is Associated with Outcome of Antidepressant Treatment
American Journal of Human Genetics, 2006
The pharmacological effect of citalopram resides in the (S)-(+)-enantiomer
Journal of Neural Transmission, 1992

Cited by 6 articles