Post-examination analysis of objective tests

24 May 2011

journal article
research article
Published by Taylor & Francis Ltd in Medical Teacher

Vol. 33 (6), 447-458
https://doi.org/10.3109/0142159x.2011.564682

Abstract

One of the key goals of assessment in medical education is the minimisation of all errors influencing a test in order to produce an observed score which approaches a learner's ‘true’ score, as reliably and validly as possible. In order to achieve this, assessors need to be aware of the potential biases that can influence all components of the assessment cycle from question creation to the interpretation of exam scores. This Guide describes and explains the processes whereby objective examination results can be analysed to improve the validity and reliability of assessments in medical education. We cover the interpretation of measures of central tendency, measures of variability and standard scores. We describe how to calculate the item-difficulty index and item-discrimination index in examination tests using different statistical procedures. This is followed by an overview of reliability estimates. The post-examination analytical methods described in this guide enable medical educators to construct reliable and valid achievement tests. They also enable medical educators to develop question banks using the collection of appropriate questions from existing examination tests in order to use computerised adaptive testing.

Keywords

This publication has 14 references indexed in Scilit:

A primer on classical test theory and item response theory for assessments in medical education
Medical Education, 2010
Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37
Medical Teacher, 2008
Understanding Internal Consistency Reliability Estimates: A Conceptual Primer on Coefficient Alpha
Measurement and Evaluation in Counseling and Development, 2001
Standard setting in medical education
Academic Medicine, 1996
What is coefficient alpha? An examination of theory and applications.
Journal of Applied Psychology, 1993
TECHNICAL GUIDELINES FOR ASSESSING COMPUTERIZED ADAPTIVE TESTS
Journal of Educational Measurement, 1984
The subject reacts to tests.
American Psychologist, 1967
Instructional technology and the measurement of learing outcomes: Some questions.
American Psychologist, 1963
Item analysis in relation to educational and psychological testing.
Psychological Bulletin, 1952
The selection of upper and lower groups for the validation of test items.
Journal of Educational Psychology, 1939

Cited by 104 articles