Assessing risk prediction models in case–control studies using semiparametric and nonparametric methods

29 March 2010

journal article
research article
Published by Wiley in Statistics in Medicine

Vol. 29 (13), 1391-1410
https://doi.org/10.1002/sim.3876

Abstract

The predictiveness curve is a graphical tool that characterizes the population distribution of Risk(Y)=P(D=1|Y), where D denotes a binary outcome such as occurrence of an event within a specified time period and Y denotes predictors. A wider distribution of Risk(Y) indicates better performance of a risk model in the sense that making treatment recommendations is easier for more subjects. Decisions are more straightforward when a subject's risk is deemed to be high or low. Methods have been developed to estimate predictiveness curves from cohort studies. However, early phase studies to evaluate novel risk prediction markers typically employ case–control designs. Here, we present semiparametric and nonparametric methods for evaluating a continuous risk prediction marker that accommodates case–control data. Small sample properties are investigated through simulation studies. The semiparametric methods are substantially more efficient than their nonparametric counterparts under a correctly specified model. We generalize them to settings where multiple prediction markers are involved. Applications to prostate cancer risk prediction markers illustrate methods for comparing the risk prediction capacities of markers and for evaluating the increment in performance gained by adding a marker to a baseline risk model. We propose a modified Hosmer–Lemeshow test for case–control study data to assess calibration of the risk model that is a natural complement to this graphical tool. Copyright © 2010 John Wiley & Sons, Ltd.

Keywords

This publication has 36 references indexed in Scilit:

A Parametric ROC Model‐Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control Studies
Biometrics, 2009
Semiparametric methods for evaluating risk prediction markers in case-control studies
Biometrika, 2009
Pivotal Evaluation of the Accuracy of a Biomarker Used for Classification or Prediction: Standards for Study Design
JNCI Journal of the National Cancer Institute, 2008
Integrating the Predictiveness of a Marker with Its Performance as a Classifier
American Journal of Epidemiology, 2007
Evaluating the Predictiveness of a Continuous Marker
Biometrics, 2007
A Gene-Expression Signature as a Predictor of Survival in Breast Cancer
The New England Journal of Medicine, 2002
Gene expression profiling predicts clinical outcome of breast cancer
Nature, 2002
Statistics in Epidemiology: The Case-Control Study
Journal of the American Statistical Association, 1996
Smoothing reference centile curves: The lms method and penalized likelihood
Statistics in Medicine, 1992
Projecting Individualized Probabilities of Developing Breast Cancer for White Females Who Are Being Examined Annually
JNCI Journal of the National Cancer Institute, 1989

Cited by 38 articles