Home > Research > Publications & Outputs > Propensity score matching with missing covariat...
View graph of relations

Propensity score matching with missing covariates via iterated, sequential multiple imputation

Research output: Working paper

Published
Close
Publication date1/04/2011
<mark>Original language</mark>English

Abstract

In many observational studies, analysts estimate causal effects using propensity score matching. Estimation of propensity scores is complicated when covariate values intended for collection are in fact missing. To handle the missing data, one approach is to use multiple imputation to create completed datasets, and compute propensity scores from these datasets. However, inaccurate imputation models can result in ineffective matching, thereby limiting reductions in bias. We propose a multiple imputation approach based on chained equations in which the researcher gradually reduces the set of control units used to estimate the imputation models. This approach can reduce the influence of control records far from the treated units? region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations and better balance in the true covariate distributions. This approach can be conveniently implemented with standard multiple imputation software for missing data. Using simulations, we find that the approach can improve estimation when imputation models are mis-specified; however, it can be ineffective when imputation models are correctly specified. This suggests using the approach as part of sensitivity analysis in causal inference. We apply the approach to an observational study of the effect of breast-feeding on the child?s educational outcomes later in life.