Likelihood-Based Item-Fit Indices for Dichotomous Item Response Theory Models

Top Cited Papers

1 March 2000

journal article
Published by SAGE Publications in Applied Psychological Measurement

Vol. 24 (1), 50-64
https://doi.org/10.1177/01466216000241003

Abstract

New goodness-of-fit indices are introduced for dichotomous item response theory (IRT) models. These indices are based on the likelihoods of number-correct scores derived from the IRT model, and they provide a direct comparison of the modeled and observed frequencies for correct and incorrect responses for each number-correct score. The behavior of Pearson’s X² ( S- X²) and the likelihood ratio G² ( S- G²) was assessed in a simulation study and compared with two fit indices similar to those currently in use ( Q1- X² and Q₁- G²). The simulations included three conditions in which the simulating and fitting models were identical and three conditions involving model misspecification. S- X² performed well, with Type I error rates close to the expected .05 and .01 levels. Performance of this index improved with increased test length. S- G² tended to reject the null hypothesis too often, as did Q₁- X² and Q₁- G². The power of S- X² appeared to be similar for all test lengths, but varied depending on the type of model misspecification.

Keywords

This publication has 19 references indexed in Scilit:

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses
Applied Psychological Measurement, 1995
A Conditional Item-Fit Index for Rasch Models
Applied Psychological Measurement, 1994
A Monte Carlo Investigation of Several Person and Item Fit Statistics for Item Response Models
Applied Psychological Measurement, 1987
The Analysis of Item-Ability Regressions: An Exploratory IRT Model Fit Tool
Applied Psychological Measurement, 1985
A Comparison of Several Goodness-of-Fit Statistics
Applied Psychological Measurement, 1985
Comparison of IRT True-Score and Equipercentile Observed-Score "Equatings"
Applied Psychological Measurement, 1984
Using Simulation Results to Choose a Latent Trait Model
Applied Psychological Measurement, 1981
Small-Sample Comparisons of Exact Levels for Chi-Squared Goodness-of-Fit Statistics
Journal of the American Statistical Association, 1978
An Investigation of the Restraints with Respect to Sample Size Commonly Imposed on the Use of the Chi-Square Statistic
Journal of the American Statistical Association, 1971
A Procedure for Sample-Free Item Analysis
Educational and Psychological Measurement, 1969

Cited by 453 articles