The Use of Propensity Scores to Assess the Generalizability of Results from Randomized Trials

Abstract

Summary: Randomized trials remain the most accepted design for estimating the effects of interventions, but they do not necessarily answer a question of primary interest: will the programme be effective in a target population in which it may be implemented? In other words, are the results generalizable? There has been very little statistical research on how to assess the generalizability, or ‘external validity’, of randomized trials. We propose the use of propensity-score-based metrics to quantify the similarity of the participants in a randomized trial and a target population. In this setting the propensity score model predicts participation in the randomized trial, given a set of covariates. The resulting propensity scores are used first to quantify the difference between the trial participants and the target population, and then to match, subclassify or weight the control group outcomes to the population, assessing how well the propensity-score-adjusted outcomes track the outcomes that are actually observed in the population. These metrics can serve as a first step in assessing the generalizability of results from randomized trials to target populations. The paper lays out these ideas, discusses the assumptions underlying the approach and illustrates the metrics by using data on the evaluation of a schoolwide prevention programme called ‘Positive behavioral interventions and supports’.

Keywords

Funding Information

National Institute of Mental Health (K25 MH083846, 1 R01 MH67948-1A1)
Centers for Disease Control (R49/CCR318627)
Institute of Education Sciences (R305A090307)

This publication has 56 references indexed in Scilit:

Generalizing Evidence From Randomized Clinical Trials to Target Populations: The ACTG 320 Trial
American Journal of Epidemiology, 2010
Selection criteria and generalizability within the counterfactual framework: explaining the paradox of antidepressant-induced suicidality?
Clinical Trials, 2009
Adjustment for Selection Bias in Observational Studies with Application to the Analysis of Autopsy Data
Neuroepidemiology, 2009
Evaluating bias correction in weighted proportional hazards regression
Lifetime Data Analysis, 2008
Constructing Inverse Probability Weights for Marginal Structural Models
American Journal of Epidemiology, 2008
Methods for testing theory and evaluating impact in randomized field trials: Intent-to-treat analyses for integrating the perspectives of person, place, and time
Drug and Alcohol Dependence, 2008
Generalizing from clinical trial data: A case study. The risk of suicidality among pediatric antidepressant users
Statistics in Medicine, 2008
Recent developments in meta‐analysis
Statistics in Medicine, 2007
Variable Selection for Propensity Score Models
American Journal of Epidemiology, 2006
The central role of the propensity score in observational studies for causal effects
Biometrika, 1983

Cited by 328 articles