Building and Supporting a Case for Test Use

Top Cited Papers

1 January 2005

journal article
Published by Taylor & Francis Ltd in Language Assessment Quarterly

Vol. 2 (1), 1-34
https://doi.org/10.1207/s15434311laq0201_1

Abstract

The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how to go about investigating these in the course of practical test development. Argument-based formulations of validity (e.g., Kane, 1992, 2000; Kane, Crooks, & Cohen, 1999; Mislevy, in press; Mislevy, Steinberg, & Almond, 2003) provide a logic and set of procedures for investigating and supporting claims about score-based inferences but do not address issues of test use and the consequences of test use. Recent formulations in language testing (e.g., Bachman & Palmer, 1996; Kunnan, 2003; Lynch, 2001) are essentially lists of more or less independent qualities and questions, with no clear mechanism for integrating these into a set of procedures for test developers and users to follow. What has been called "critical language testing" (e.g, Shohamy, 1999, 2001) has alerted us to the political uses and abuses of language tests and to the need for test developers and test users alike to be self-critical of the ways in which tests are used. However, this perspective treats consequences as essentially unrelated to the validity of inferences and provides little guidance about how to go about either anticipating and avoiding, or redressing, the problems with test use that it discusses. In this article I describe how an argument for test use might be structured so as to provide a clear linkage from test performance to interpretations and from interpretations to uses. An assessment use argument is an overall logical framework for linking assessment performance to use (decisions). This assessment use argument includes two parts: an assessment utilization argument, linking an interpretation to a decision, and an assessment validity argument, which links assessment performance to an interpretation. I then discuss ways in which issues and questions that have been raised by language testers regarding uses, abuses, consequences, validity, and fairness in language testing can provide a basis for articulating claims and counterclaims in an assessment use argument. In my view, an assessment use argument can guide the design and development of assessments and can also lead to a focused, efficient program for collecting the most critical evidence in support of the interpretations and uses for which the assessment is intended.

Keywords

This publication has 29 references indexed in Scilit:

Multiple Measures: Toward Tiered Systems
Educational Measurement: Issues and Practice, 2005
Statistical Analyses for Language Assessment
Published by Cambridge University Press (CUP) ,2004
Substance and structure in assessment arguments
Law, Probability and Risk, 2003
Commentaries
Measurement: Interdisciplinary Research and Perspectives, 2003
Design and analysis in task-based language assessment
Language Testing, 2002
Democratic assessment as an alternative
Language Testing, 2001
Rethinking assessment from a critical perspective
Language Testing, 2001
Language assessment as social practice: challenges for research
Language Testing, 2001
What does test bias have to do with fairness?
Language Testing, 1997
An argument-based approach to validity.
Psychological Bulletin, 1992

Cited by 201 articles