Propensity score matching with missing covariates via iterated, sequential multiple imputation

Mathematics and Statistics

Keywords

missing data, multiple imputation, observational studies, propensity scores

View graph of relations

Research output: Working paper

Published

Robin Mitra
Jerome P. Reiter

More...

Publication date	1/04/2011
<mark>Original language</mark>	English

Abstract

In many observational studies, analysts estimate causal effects using propensity score matching. Estimation of propensity scores is complicated when covariate values intended for collection are in fact missing. To handle the missing data, one approach is to use multiple imputation to create completed datasets, and compute propensity scores from these datasets. However, inaccurate imputation models can result in ineffective matching, thereby limiting reductions in bias. We propose a multiple imputation approach based on chained equations in which the researcher gradually reduces the set of control units used to estimate the imputation models. This approach can reduce the influence of control records far from the treated units? region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations and better balance in the true covariate distributions. This approach can be conveniently implemented with standard multiple imputation software for missing data. Using simulations, we find that the approach can improve estimation when imputation models are mis-specified; however, it can be ineffective when imputation models are correctly specified. This suggests using the approach as part of sensitivity analysis in causal inference. We apply the approach to an observational study of the effect of breast-feeding on the child?s educational outcomes later in life.

Research

Links

Keywords

Propensity score matching with missing covariates via iterated, sequential multiple imputation

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us