Post-examination analysis of objective tests
- 24 May 2011
- journal article
- research article
- Published by Taylor & Francis Ltd in Medical Teacher
- Vol. 33 (6), 447-458
- https://doi.org/10.3109/0142159x.2011.564682
Abstract
One of the key goals of assessment in medical education is the minimisation of all errors influencing a test in order to produce an observed score which approaches a learner's ‘true’ score, as reliably and validly as possible. In order to achieve this, assessors need to be aware of the potential biases that can influence all components of the assessment cycle from question creation to the interpretation of exam scores. This Guide describes and explains the processes whereby objective examination results can be analysed to improve the validity and reliability of assessments in medical education. We cover the interpretation of measures of central tendency, measures of variability and standard scores. We describe how to calculate the item-difficulty index and item-discrimination index in examination tests using different statistical procedures. This is followed by an overview of reliability estimates. The post-examination analytical methods described in this guide enable medical educators to construct reliable and valid achievement tests. They also enable medical educators to develop question banks using the collection of appropriate questions from existing examination tests in order to use computerised adaptive testing.Keywords
This publication has 14 references indexed in Scilit:
- A primer on classical test theory and item response theory for assessments in medical educationMedical Education, 2010
- Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37Medical Teacher, 2008
- Understanding Internal Consistency Reliability Estimates: A Conceptual Primer on Coefficient AlphaMeasurement and Evaluation in Counseling and Development, 2001
- Standard setting in medical educationAcademic Medicine, 1996
- What is coefficient alpha? An examination of theory and applications.Journal of Applied Psychology, 1993
- TECHNICAL GUIDELINES FOR ASSESSING COMPUTERIZED ADAPTIVE TESTSJournal of Educational Measurement, 1984
- The subject reacts to tests.American Psychologist, 1967
- Instructional technology and the measurement of learing outcomes: Some questions.American Psychologist, 1963
- Item analysis in relation to educational and psychological testing.Psychological Bulletin, 1952
- The selection of upper and lower groups for the validation of test items.Journal of Educational Psychology, 1939