Back to overview

An Evaluation of Overall Goodness-of-Fit Tests for the Rasch Model

Type of publication Peer-reviewed
Publikationsform Original article (peer-reviewed)
Author Debelak Rudolf,
Project Detecting Heterogeneity in Complex IRT Models for Measuring Latent Traits
Show all

Original article (peer-reviewed)

Journal Frontiers in Psychology
Volume (Issue) 9
Title of proceedings Frontiers in Psychology
DOI 10.3389/fpsyg.2018.02710

Open Access

Type of Open Access Publisher (Gold Open Access)


For assessing the fit of item response theory models, it has been suggested to apply overall goodness-of-fit tests as well as tests for individual items and item pairs. Although numerous goodness-of-fit tests have been proposed in the literature for the Rasch model, their relative power against several model violations has not been investigated so far. This study compares four of these tests, which are all available in R software: T10, T11, M2, and the LR test. Results on the Type I error rate and the sensitivity to violations of different assumptions of the Rasch model (unidimensionality, local independence on the level of item pairs, equal item discrimination, zero as a lower asymptote for the item characteristic curves, invariance of the item parameters) are reported. The results indicate that the T11 test is comparatively most powerful against violations of the assumption of parallel item characteristic curves, which includes the presence of unequal item discriminations and a non-zero lower asymptote. Against the remaining model violations, which can be summarized as local dependence, M2 is found to be most powerful. T10 and LR are found to be sensitive against violations of the assumption of parallel item characteristic curves, but are insensitive against local dependence.