Statistical Methodology for the Concurrent Assessment of Interrater and Intrarater Reliability: Using Goniometric Measurements as an Example

Abstract
Statistical methodology for the concurrent assessment of interrater and intrarater reliability is presented. Application of the methodology is illustrated with an example of one therapist using two goniometers repeatedly to measure knee joint angles. Methods for estimating the coefficients, testing hypotheses, constructing confidence intervals, and computing sample size requirements are provided. In addition, the calculation and clinical interpretation of the standard error of measurement (SEM) are discussed. It is recommended that (1) when both interrater and intrarater reliability are being assessed, a repeated-measures design be used to take advantage of the increased precision gained by using all observations in the statistical analysis, and (2) appropriate statistical tests, confidence intervals, and SEMs always be used in conjunction with the estimated reliability coefficients.