Home > Research > Publications & Outputs > Optimal design when outcome values are not miss...

Electronic data

  • SS-2016-0526_Preprint

    Accepted author manuscript, 2.66 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

  • A28n48

    Final published version, 414 KB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Optimal design when outcome values are not missing at random

Research output: Contribution to journalJournal articlepeer-review

Published
Close
<mark>Journal publication date</mark>1/10/2018
<mark>Journal</mark>Statistica Sinica
Issue number4
Volume28
Number of pages18
Pages (from-to)1821-1838
Publication StatusPublished
Early online date1/04/17
<mark>Original language</mark>English

Abstract

The presence of missing values complicates statistical analyses. In design of experiments, missing values are particularly problematic when constructing optimal designs, as it is not known which values are missing at the design stage. When data are missing at random it is possible to incorporate this information into the optimality criterion that is used to find designs; Imhof, Song and Wong (2002) develop such a framework. However, when data are not missing at random this framework can lead to inefficient designs. We investigate and address the specific challenges that not missing at random values present when finding optimal designs for linear regression models. We show that the optimality criteria will depend on model parameters that traditionally do not affect the design, such as regression coefficients and the residual variance. We also develop a framework that improves efficiency of designs over those found assuming values are missing at random.