Reliability of performance on standardized patient cases: A comparison of consistency measures based on generalizability theory

1 January 1989

journal article
research article
Published by Informa UK Limited in Teaching and Learning in Medicine

Vol. 1 (1), 31-37
https://doi.org/10.1080/10401338909539375

Abstract

Standardized patient cases have assumed an important role in the assessment of clinical competence in recent years. The reliability (consistency) of performance across standardized patient cases has been determined with consistency measures derived from generalizability theory—namely, the generalizability coefficient, Ep²; the dependability index, ; and the dependability index with cutoff, ϕ(C). These three consistency measures can be computed for quantitatively scored cases and for dichotomously scored cases; hence, six consistency measures could be computed for a given examination. Our purpose was to draw attention to the sizable differences among the computed values of these consistency measures for a new set of clinical competence examination data and to provide a review of the interpretations of the different measures. The findings showed considerable differences among the consistency measures, the number of cases needed to achieve the 0.80 reliability level, and the time required to administer that number of cases. These differences underscore the need to carefully identify the specific consistency measure used in a given study and to attend closely to the interpretation associated with that measure.

Keywords

This publication has 8 references indexed in Scilit:

Assessing Clinical Skills of Residents with Standardized Patients
Annals of Internal Medicine, 1986
A Consumer’s Guide to Setting Performance Standards on Criterion-Referenced Tests
Review of Educational Research, 1986
Errors of Measurement and Standard Setting in Mastery Testing
Applied Psychological Measurement, 1984
A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory
Applied Psychological Measurement, 1980
Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests
Applied Psychological Measurement, 1980
AN INDEX OF DEPENDABILITY FOR MASTERY TESTS
Journal of Educational Measurement, 1977
CRITERION‐REFERENCED APPLICATIONS OF CLASSICAL TEST THEORY ^1,²
Journal of Educational Measurement, 1972
Ability to Avoid Gross Error as a Measure of Achievment
Educational and Psychological Measurement, 1954

Cited by 40 articles