Home > Research > Publications & Outputs > Modelling multivariate binary data with alterna...
View graph of relations

Modelling multivariate binary data with alternating logistic regressions.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Modelling multivariate binary data with alternating logistic regressions. / Carey, V.; Zeger, S. L.; Diggle, Peter J.
In: Biometrika, Vol. 80, No. 3, 1993, p. 517-526.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Carey V, Zeger SL, Diggle PJ. Modelling multivariate binary data with alternating logistic regressions. Biometrika. 1993;80(3):517-526. doi: 10.1093/biomet/80.3.517

Author

Carey, V. ; Zeger, S. L. ; Diggle, Peter J. / Modelling multivariate binary data with alternating logistic regressions. In: Biometrika. 1993 ; Vol. 80, No. 3. pp. 517-526.

Bibtex

@article{ea0dd19bfd1e4f21be7934d415863a56,
title = "Modelling multivariate binary data with alternating logistic regressions.",
abstract = "Marginal models for multivariate binary data permit separate modelling of the relationship of the response with explanatory variables, and the association between pairs of responses. When the former is the scientific focus, a first-order generalized estimating equation method (Liang & Zeger, 1986) is easy to implement and gives efficient estimates of regression coefficients, although estimates of the association among the binary outcomes can be inefficient. When the association model is a focus, simultaneous modelling of the responses and all pairwise products (Prentice, 1988) using second-order estimating equations gives more efficient estimates of association parameters as well. However, this procedure can become computationally infeasible as the cluster size gets large. This paper proposes an alternative approach, alternating logistic regressions, for simultaneously regressing the response on explanatory variables as well as modelling the association among responses in terms of pairwise odds ratios. This algorithm iterates between a logistic regression using first-order generalized estimating equations to estimate regression coefficients and a logistic regression of each response on others from the same cluster using an appropriate offset to update the odds ratio parameters. For clusters of size n, alternating logistic regression involves evaluation and inversion of matrices of order n2 rather than n4 as required for second-order generalized estimating equations. The alternating logistic regression estimates are shown to be reasonably efficient relative to solutions of second-order equations in a few problems. The new method is illustrated with an analysis of neuropsychological tests on patients with epileptic seizures.",
keywords = "Clustered data • Generalized estimating equation • Logistic regression",
author = "V. Carey and Zeger, {S. L.} and Diggle, {Peter J.}",
year = "1993",
doi = "10.1093/biomet/80.3.517",
language = "English",
volume = "80",
pages = "517--526",
journal = "Biometrika",
issn = "1464-3510",
publisher = "Oxford University Press",
number = "3",

}

RIS

TY - JOUR

T1 - Modelling multivariate binary data with alternating logistic regressions.

AU - Carey, V.

AU - Zeger, S. L.

AU - Diggle, Peter J.

PY - 1993

Y1 - 1993

N2 - Marginal models for multivariate binary data permit separate modelling of the relationship of the response with explanatory variables, and the association between pairs of responses. When the former is the scientific focus, a first-order generalized estimating equation method (Liang & Zeger, 1986) is easy to implement and gives efficient estimates of regression coefficients, although estimates of the association among the binary outcomes can be inefficient. When the association model is a focus, simultaneous modelling of the responses and all pairwise products (Prentice, 1988) using second-order estimating equations gives more efficient estimates of association parameters as well. However, this procedure can become computationally infeasible as the cluster size gets large. This paper proposes an alternative approach, alternating logistic regressions, for simultaneously regressing the response on explanatory variables as well as modelling the association among responses in terms of pairwise odds ratios. This algorithm iterates between a logistic regression using first-order generalized estimating equations to estimate regression coefficients and a logistic regression of each response on others from the same cluster using an appropriate offset to update the odds ratio parameters. For clusters of size n, alternating logistic regression involves evaluation and inversion of matrices of order n2 rather than n4 as required for second-order generalized estimating equations. The alternating logistic regression estimates are shown to be reasonably efficient relative to solutions of second-order equations in a few problems. The new method is illustrated with an analysis of neuropsychological tests on patients with epileptic seizures.

AB - Marginal models for multivariate binary data permit separate modelling of the relationship of the response with explanatory variables, and the association between pairs of responses. When the former is the scientific focus, a first-order generalized estimating equation method (Liang & Zeger, 1986) is easy to implement and gives efficient estimates of regression coefficients, although estimates of the association among the binary outcomes can be inefficient. When the association model is a focus, simultaneous modelling of the responses and all pairwise products (Prentice, 1988) using second-order estimating equations gives more efficient estimates of association parameters as well. However, this procedure can become computationally infeasible as the cluster size gets large. This paper proposes an alternative approach, alternating logistic regressions, for simultaneously regressing the response on explanatory variables as well as modelling the association among responses in terms of pairwise odds ratios. This algorithm iterates between a logistic regression using first-order generalized estimating equations to estimate regression coefficients and a logistic regression of each response on others from the same cluster using an appropriate offset to update the odds ratio parameters. For clusters of size n, alternating logistic regression involves evaluation and inversion of matrices of order n2 rather than n4 as required for second-order generalized estimating equations. The alternating logistic regression estimates are shown to be reasonably efficient relative to solutions of second-order equations in a few problems. The new method is illustrated with an analysis of neuropsychological tests on patients with epileptic seizures.

KW - Clustered data • Generalized estimating equation • Logistic regression

U2 - 10.1093/biomet/80.3.517

DO - 10.1093/biomet/80.3.517

M3 - Journal article

VL - 80

SP - 517

EP - 526

JO - Biometrika

JF - Biometrika

SN - 1464-3510

IS - 3

ER -