Home > Research > Publications & Outputs > Discriminant analysis with singular covariance ...
View graph of relations

Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests. / Jonathan, P.; McCarthy, W.V.; Roberts, A.M.I.
In: Journal of Chemometrics, Vol. 10, No. 3, 1996, p. 189-213.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Jonathan P, McCarthy WV, Roberts AMI. Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests. Journal of Chemometrics. 1996;10(3):189-213. doi: 10.1002/(SICI)1099-128X(199605)10:3<189::AID-CEM410>3.0.CO;2-I

Author

Jonathan, P. ; McCarthy, W.V. ; Roberts, A.M.I. / Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests. In: Journal of Chemometrics. 1996 ; Vol. 10, No. 3. pp. 189-213.

Bibtex

@article{f925c594f89d4144b55c5b01f79b1530,
title = "Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests",
abstract = "A computationally efficient approach has been developed to perform two-group linear discriminant analysis using high-dimensional data. The analysis is based on Fisher's method and incorporates two important validation stages: 1, full leave-one-observation-out cross-validation; 2, randomized permutation distribution testing. The resulting algorithm and software are known as CREDIT (cross-validated random-permutation-tested efficient discrimination based on an adjusted generalized inverse for the sample total covariance matrix). The algorithm has been implemented in the SAS/IML matrix programming language and provides dramatic improvements in computational efficiency compared with existing software for discriminant analysis incorporating validation stages 1 and 2 above. Application of CREDIT to nine multivariate data sets indicates that the predictive performance of the approach, assessed using cross-validation, is comparable with that of other methods for discriminant analysis. Comparisons with two specific methods are included. Randomized permutation tests show that success rates using the true response classes are almost always better than success rates using random permutations of the classes. This gives confidence that there is a useful linear discriminant relationship present in the data being analysed. For a randomly selected training set (used to construct the discriminant rule) the success rates for CREDIT are unbiased predictive success rates for allocating other observations to groups. Predicting group memberships for future observations using any discriminant model based on singular estimates of covariance matrices must be performed with great care. A discussion of methods to test the concordance of future observations with the training set is given.",
keywords = "Concordance, Discriminant analysis, Permutation test, Principal components, QSAR",
author = "P. Jonathan and W.V. McCarthy and A.M.I. Roberts",
year = "1996",
doi = "10.1002/(SICI)1099-128X(199605)10:3<189::AID-CEM410>3.0.CO;2-I",
language = "English",
volume = "10",
pages = "189--213",
journal = "Journal of Chemometrics",
issn = "0886-9383",
publisher = "John Wiley and Sons Ltd",
number = "3",

}

RIS

TY - JOUR

T1 - Discriminant analysis with singular covariance matrices. A method incorporating cross-validation and efficient randomized permutation tests

AU - Jonathan, P.

AU - McCarthy, W.V.

AU - Roberts, A.M.I.

PY - 1996

Y1 - 1996

N2 - A computationally efficient approach has been developed to perform two-group linear discriminant analysis using high-dimensional data. The analysis is based on Fisher's method and incorporates two important validation stages: 1, full leave-one-observation-out cross-validation; 2, randomized permutation distribution testing. The resulting algorithm and software are known as CREDIT (cross-validated random-permutation-tested efficient discrimination based on an adjusted generalized inverse for the sample total covariance matrix). The algorithm has been implemented in the SAS/IML matrix programming language and provides dramatic improvements in computational efficiency compared with existing software for discriminant analysis incorporating validation stages 1 and 2 above. Application of CREDIT to nine multivariate data sets indicates that the predictive performance of the approach, assessed using cross-validation, is comparable with that of other methods for discriminant analysis. Comparisons with two specific methods are included. Randomized permutation tests show that success rates using the true response classes are almost always better than success rates using random permutations of the classes. This gives confidence that there is a useful linear discriminant relationship present in the data being analysed. For a randomly selected training set (used to construct the discriminant rule) the success rates for CREDIT are unbiased predictive success rates for allocating other observations to groups. Predicting group memberships for future observations using any discriminant model based on singular estimates of covariance matrices must be performed with great care. A discussion of methods to test the concordance of future observations with the training set is given.

AB - A computationally efficient approach has been developed to perform two-group linear discriminant analysis using high-dimensional data. The analysis is based on Fisher's method and incorporates two important validation stages: 1, full leave-one-observation-out cross-validation; 2, randomized permutation distribution testing. The resulting algorithm and software are known as CREDIT (cross-validated random-permutation-tested efficient discrimination based on an adjusted generalized inverse for the sample total covariance matrix). The algorithm has been implemented in the SAS/IML matrix programming language and provides dramatic improvements in computational efficiency compared with existing software for discriminant analysis incorporating validation stages 1 and 2 above. Application of CREDIT to nine multivariate data sets indicates that the predictive performance of the approach, assessed using cross-validation, is comparable with that of other methods for discriminant analysis. Comparisons with two specific methods are included. Randomized permutation tests show that success rates using the true response classes are almost always better than success rates using random permutations of the classes. This gives confidence that there is a useful linear discriminant relationship present in the data being analysed. For a randomly selected training set (used to construct the discriminant rule) the success rates for CREDIT are unbiased predictive success rates for allocating other observations to groups. Predicting group memberships for future observations using any discriminant model based on singular estimates of covariance matrices must be performed with great care. A discussion of methods to test the concordance of future observations with the training set is given.

KW - Concordance

KW - Discriminant analysis

KW - Permutation test

KW - Principal components

KW - QSAR

U2 - 10.1002/(SICI)1099-128X(199605)10:3<189::AID-CEM410>3.0.CO;2-I

DO - 10.1002/(SICI)1099-128X(199605)10:3<189::AID-CEM410>3.0.CO;2-I

M3 - Journal article

VL - 10

SP - 189

EP - 213

JO - Journal of Chemometrics

JF - Journal of Chemometrics

SN - 0886-9383

IS - 3

ER -