Smoothing of bivariate test score distributions - Model selection targeting test score equating - Research Portal

Home > Research > Publications & Outputs > Smoothing of bivariate test score distributions...

School Of Mathematical Sciences

Electronic data

Presmoothing_selection_in_OSE
Accepted author manuscript, 658 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.3102/10769986241300523
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

Smoothing of bivariate test score distributions - Model selection targeting test score equating

Research output: Contribution to Journal/Magazine › Journal article › peer-review

E-pub ahead of print

Gabriel Wallin
Marie Wiberg

More...

<mark>Journal publication date</mark>	3/01/2025
<mark>Journal</mark>	Journal of Educational and Behavioral Statistics
Publication Status	E-pub ahead of print
Early online date	3/01/25
<mark>Original language</mark>	English

Abstract

Observed-score test equating is a vital part of every testing program, aiming to make test scores across test administrations comparable. Central to this process is the equating function, typically estimated by composing distribution functions of the scores to be equated. An integral part of this estimation is presmoothing, where statistical models are fit to observed score frequencies to mitigate sampling variability. This study evaluates the impact of commonly used model fit indices on bivariate presmoothing model-selection accuracy in both item response theory (IRT) and non-IRT settings. It also introduces a new model-selection criterion that directly targets the equating function in contrast to existing methods. The study focuses on the framework of non-equivalent groups with anchor test design, estimating bivariate score distributions based on real and simulated data. Results show that the choice of presmoothing model and model fit criterion influences the equated scores. In non-IRT contexts, a combination of the proposed model-selection criterion and the Bayesian information criterion exhibited superior performance, balancing bias, and variance of the equated scores. For IRT models, high selection accuracy and minimal equating error were achieved across all scenarios.

Research

Electronic data

Links

Text available via DOI:

Smoothing of bivariate test score distributions - Model selection targeting test score equating

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us

Research

Electronic data

Links

Text available via DOI:

Smoothing of bivariate test score distributions - Model selection targeting test score equating

Abstract

Related research outputs

DIF Analysis with Unknown Groups and Anchor Items

Quick Links

Connect With Us

Faculties & Depts

Contact Us