Diagnosing Tests: Using and Misusing Diagnostic and Screening Tests

Top Cited Papers

1 December 2003

journal article
Published by Informa UK Limited in Journal of Personality Assessment

Vol. 81 (3), 209-219
https://doi.org/10.1207/s15327752jpa8103_03

Abstract

Tests can be used either diagnostically (i.e., to confirm or rule out the presence of a condition in people suspected of having it) or as a screening instrument (determining who in a large group of people has the condition and often when those people are unaware of it or unwilling to admit to it). Tests that may be useful and accurate for diagnosis may actually do more harm than good when used as a screening instrument. The reason is that the proportion of false negatives may be high when the prevalence is high, and the proportion of false positives tends to be high when the prevalence of the condition is low (the usual situation with screening tests). My first aim of this article is to discuss the effects of the base rate, or prevalence, of a disorder on the accuracy of test results. My second aim is to review some of the many diagnostic efficiency statistics that can be derived from a 2 x 2 table, including the overall correct classification rate, kappa, phi, the odds ratio, positive and negative predictive power and some variants of them, and likelihood ratios. In the last part of this article, I review the recent Standards for Reporting of Diagnostic Accuracy guidelines (Bossuyt et al., 2003) for reporting the results of diagnostic tests and extend them to cover the types of tests used by psychologists.

Keywords

This publication has 38 references indexed in Scilit:

Towards Complete and Accurate Reporting of Studies of Diagnostic Accuracy: The STARD Initiative
Annals of Internal Medicine, 2003
Development of a Short Leyton Obsessional Inventory for Children and Adolescents
Journal of the American Academy of Child & Adolescent Psychiatry, 2002
The detection of feigned uncoached and coached posttraumatic stress disorder with the MMPI-2 in a sample of workplace accident victims.
Psychological Assessment, 2002
Comparison of Beck Depression Inventories-IA and-II in Psychiatric Outpatients
Journal of Personality Assessment, 1996
Race-of-Interviewer Effects in a Preelection Poll: Virginia 1989
Public Opinion Quarterly, 1991
The usefulness of the Denver Developmental Screening Test to predict kindergarten problems in a general community population.
American Journal of Public Health, 1984
Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit.
Psychological Bulletin, 1968
The Morbidity of Cardiac Nondisease in Schoolchildren
New England Journal of Medicine, 1967
A Coefficient of Agreement for Nominal Scales
Educational and Psychological Measurement, 1960
Convergent and discriminant validation by the multitrait-multimethod matrix.
Psychological Bulletin, 1959

Cited by 224 articles