Estimating propensity scores with missing covariate data using general location mixture models

School Of Mathematical Sciences

Text available via DOI:

https://doi.org/10.1002/sim.4124
Final published version

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Estimating propensity scores with missing covariate data using general location mixture models. / Mitra, Robin; Reiter, Jerome P.
In: Statistics in Medicine, Vol. 30, No. 6, 15.03.2011, p. 627-641.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Mitra, R & Reiter, JP 2011, 'Estimating propensity scores with missing covariate data using general location mixture models', Statistics in Medicine, vol. 30, no. 6, pp. 627-641. https://doi.org/10.1002/sim.4124

APA

Mitra, R., & Reiter, J. P. (2011). Estimating propensity scores with missing covariate data using general location mixture models. Statistics in Medicine, 30(6), 627-641. https://doi.org/10.1002/sim.4124

Vancouver

Mitra R, Reiter JP. Estimating propensity scores with missing covariate data using general location mixture models. Statistics in Medicine. 2011 Mar 15;30(6):627-641. Epub 2010 Dec 28. doi: 10.1002/sim.4124

Author

Mitra, Robin ; Reiter, Jerome P. / Estimating propensity scores with missing covariate data using general location mixture models. In: Statistics in Medicine. 2011 ; Vol. 30, No. 6. pp. 627-641.

Bibtex

@article{ac7412e135af4587b92384bb27922f03,

title = "Estimating propensity scores with missing covariate data using general location mixture models",

abstract = "In many observational studies, analysts estimate causal effects using propensity scores, e.g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities",

author = "Robin Mitra and Reiter, {Jerome P.}",

year = "2011",

month = mar,

day = "15",

doi = "10.1002/sim.4124",

language = "English",

volume = "30",

pages = "627--641",

journal = "Statistics in Medicine",

issn = "0277-6715",

publisher = "John Wiley and Sons Ltd",

number = "6",

}

RIS

TY - JOUR

T1 - Estimating propensity scores with missing covariate data using general location mixture models

AU - Mitra, Robin

AU - Reiter, Jerome P.

PY - 2011/3/15

Y1 - 2011/3/15

N2 - In many observational studies, analysts estimate causal effects using propensity scores, e.g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities

AB - In many observational studies, analysts estimate causal effects using propensity scores, e.g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities

U2 - 10.1002/sim.4124

DO - 10.1002/sim.4124

M3 - Journal article

VL - 30

SP - 627

EP - 641

JO - Statistics in Medicine

JF - Statistics in Medicine

SN - 0277-6715

IS - 6

ER -

Research

Links

Text available via DOI: