Penny-wise and pound-foolish: the impact of measurement error on sample size requirements in clinical trials

1 April 2000

journal article
research article
Published by Elsevier BV in Biological Psychiatry

Vol. 47 (8), 762-766
https://doi.org/10.1016/s0006-3223(00)00837-4

Abstract

Background: Clinical research studies must compensate for measurement error by increasing the number of subjects that are studied, thereby increasing the financial costs of research and exposing greater numbers of subjects to study risks. In this article, we model the relationship between reliability and sample-size requirements and consider the potential tangible cost savings resulting from the decreased number of subjects needed when reliability of raters is improved or multiple ratings are used. Methods: Standard methods are used to model reliability based on the intraclass correlation coefficient ( R) and to perform power calculations. The impact of multiple raters on reliability for a given baseline level of reliability is modeled according to the Spearman Brown formula. Results: Our models demonstrate that meaningful reductions in sample size requirements are gained from improvements in reliability. For example, improving reliability from R = .7 to R = .9 will decreases sample size requirements by 22%. Reliability is improved by training and by the use of the mean of multiple ratings. For example, if the reliability of a single rating is 0.7, the reliability of the mean of two ratings will be 0.8. Conclusions: The costs to improve reliability either through rater training efforts or use of the mean of multiple ratings is cost effective because of the consequent reduction in number of subjects needed. Efforts to improve reliability and thus reduce subject requirements in a study also may lead to fewer patients bearing the burden of research participation and to a shortening of the duration of studies.

Keywords

This publication has 3 references indexed in Scilit:

Improvement of inter‐rater reliability of PANSS items and subscales by a standardized rater training
Acta Psychiatrica Scandinavica, 1998
More Reliable Outcome Measures Can Reduce Sample Size Requirements
Archives of General Psychiatry, 1995
ON THE METHODS AND THEORY OF RELIABILITY
The Journal of Nervous and Mental Disease, 1976

Cited by 87 articles