A Cautionary Note on Using G²(dif) to Assess Relative Model Fit in Categorical Data Analysis

1 March 2006

journal article
Published by Informa UK Limited in Multivariate Behavioral Research

Vol. 41 (1), 55-64
https://doi.org/10.1207/s15327906mbr4101_4

Abstract

The likelihood ratio test statistic G(2)(dif) is widely used for comparing the fit of nested models in categorical data analysis. In large samples, this statistic is distributed as a chi-square with degrees of freedom equal to the difference in degrees of freedom between the tested models, but only if the least restrictive model is correctly specified. Yet, this statistic is often used in applications without assessing the adequacy of the least restrictive model. This may result in incorrect substantive conclusions as the above large sample reference distribution for G(2)(dif) is no longer appropriate. Rather, its large sample distribution will depend on the degree of model misspecification of the least restrictive model. To illustrate this, a simulation study is performed where this statistic is used to compare nested item response theory models under various degrees of misspecification of the least restrictive model. G(2)(dif) was found to be robust only under small model misspecification of the least restrictive model. Consequently, we argue that some indication of the absolute goodness of fit of the least restrictive model is needed before employing G(2)(dif) to assess relative model fit.

Keywords

This publication has 14 references indexed in Scilit:

Using Graphical Methods in Assessing Measurement Invariance in Inventory Data
Multivariate Behavioral Research, 1999
The Goodness of Fit of Latent Trait Models in Attitude Measurement
Sociological Methods & Research, 1999
Fitting Polytomous Item Response Theory Models to Multiple-Choice Tests
Applied Psychological Measurement, 1995
Goodness-of-Fit Testing for Latent Class Models
Multivariate Behavioral Research, 1993
Item Response Theory
Published by Springer Science and Business Media LLC ,1985
Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm
Psychometrika, 1981
An Empirical Investigation of Goodness-of-Fit Statistics for Sparse Multinomials
Journal of the American Statistical Association, 1980
Small-Sample Comparisons of Exact Levels for Chi-Squared Goodness-of-Fit Statistics
Journal of the American Statistical Association, 1978
Log-Linear Models and Frequency Tables with Small Expected Cell Counts
The Annals of Statistics, 1977
The $\chi^2$ Test of Goodness of Fit
The Annals of Mathematical Statistics, 1952

Cited by 41 articles

A Cautionary Note on Using G2(dif) to Assess Relative Model Fit in Categorical Data Analysis

Abstract

Keywords

A Cautionary Note on Using G²(dif) to Assess Relative Model Fit in Categorical Data Analysis