Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

School Of Mathematical Sciences

Electronic data

wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores
Final published version, 536 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.3102/10769986231161575
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores. / Wallin, Gabriel; Wiberg, Marie.
In: Journal of Educational and Behavioral Statistics, Vol. 48, No. 5, 31.10.2023, p. 603-635.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Wallin, G & Wiberg, M 2023, 'Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores', Journal of Educational and Behavioral Statistics, vol. 48, no. 5, pp. 603-635. https://doi.org/10.3102/10769986231161575

APA

Wallin, G., & Wiberg, M. (2023). Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores. Journal of Educational and Behavioral Statistics, 48(5), 603-635. https://doi.org/10.3102/10769986231161575

Vancouver

Wallin G, Wiberg M. Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores. Journal of Educational and Behavioral Statistics. 2023 Oct 31;48(5):603-635. Epub 2023 May 9. doi: 10.3102/10769986231161575

Author

Wallin, Gabriel ; Wiberg, Marie. / Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores. In: Journal of Educational and Behavioral Statistics. 2023 ; Vol. 48, No. 5. pp. 603-635.

Bibtex

@article{d3e224ae8cd74ccaad2fd76b375ec348,

title = "Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores",

abstract = "This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the propensity score model. The study assumes a parametric form of the propensity score and evaluates the effects of various misspecification scenarios on equating error. The results, based on both simulated and real testing data, show that (1) omitting an important covariate leads to biased estimates of the equated scores, (2) misspecifying a nonlinear relationship between the covariates and test scores increases the equating standard error in the tails of the score distributions, and (3) the equating estimators are robust against omitting a second-order term as well as using an incorrect link function in the propensity score estimation model. The findings demonstrate that auxiliary information is beneficial for test score equating in complex settings. However, it also sheds light on the challenge of making fair comparisons between nonequivalent test groups in the absence of common items. The study identifies scenarios, where equating performance is acceptable and problematic, provides practical guidelines, and identifies areas for further investigation.",

author = "Gabriel Wallin and Marie Wiberg",

year = "2023",

month = oct,

day = "31",

doi = "10.3102/10769986231161575",

language = "English",

volume = "48",

pages = "603--635",

journal = "Journal of Educational and Behavioral Statistics",

publisher = "SAGE Publications Inc.",

number = "5",

}

RIS

TY - JOUR

T1 - Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

AU - Wallin, Gabriel

AU - Wiberg, Marie

PY - 2023/10/31

Y1 - 2023/10/31

N2 - This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the propensity score model. The study assumes a parametric form of the propensity score and evaluates the effects of various misspecification scenarios on equating error. The results, based on both simulated and real testing data, show that (1) omitting an important covariate leads to biased estimates of the equated scores, (2) misspecifying a nonlinear relationship between the covariates and test scores increases the equating standard error in the tails of the score distributions, and (3) the equating estimators are robust against omitting a second-order term as well as using an incorrect link function in the propensity score estimation model. The findings demonstrate that auxiliary information is beneficial for test score equating in complex settings. However, it also sheds light on the challenge of making fair comparisons between nonequivalent test groups in the absence of common items. The study identifies scenarios, where equating performance is acceptable and problematic, provides practical guidelines, and identifies areas for further investigation.

AB - This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the propensity score model. The study assumes a parametric form of the propensity score and evaluates the effects of various misspecification scenarios on equating error. The results, based on both simulated and real testing data, show that (1) omitting an important covariate leads to biased estimates of the equated scores, (2) misspecifying a nonlinear relationship between the covariates and test scores increases the equating standard error in the tails of the score distributions, and (3) the equating estimators are robust against omitting a second-order term as well as using an incorrect link function in the propensity score estimation model. The findings demonstrate that auxiliary information is beneficial for test score equating in complex settings. However, it also sheds light on the challenge of making fair comparisons between nonequivalent test groups in the absence of common items. The study identifies scenarios, where equating performance is acceptable and problematic, provides practical guidelines, and identifies areas for further investigation.

U2 - 10.3102/10769986231161575

DO - 10.3102/10769986231161575

M3 - Journal article

VL - 48

SP - 603

EP - 635

JO - Journal of Educational and Behavioral Statistics

JF - Journal of Educational and Behavioral Statistics

IS - 5

ER -

Research

Electronic data

Links

Text available via DOI: