Home > Research > Publications & Outputs > Subset Multivariate Collective And Point Anomal...

Electronic data

  • MVCAPA_JCGS_Revision-6

    Rights statement: This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Computational and Graphical Statistics on 19/11/2021, available online: https://www.tandfonline.com/doi/full/10.1080/10618600.2021.1987257

    Accepted author manuscript, 2.76 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Subset Multivariate Collective And Point Anomaly Detection

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Subset Multivariate Collective And Point Anomaly Detection. / Fisch, Alex; Eckley, Idris; Fearnhead, Paul.
In: Journal of Computational and Graphical Statistics, Vol. 31, No. 2, 30.06.2022, p. 574-585.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Fisch A, Eckley I, Fearnhead P. Subset Multivariate Collective And Point Anomaly Detection. Journal of Computational and Graphical Statistics. 2022 Jun 30;31(2):574-585. Epub 2021 Nov 19. doi: 10.1080/10618600.2021.1987257

Author

Fisch, Alex ; Eckley, Idris ; Fearnhead, Paul. / Subset Multivariate Collective And Point Anomaly Detection. In: Journal of Computational and Graphical Statistics. 2022 ; Vol. 31, No. 2. pp. 574-585.

Bibtex

@article{dbdf814a291c404199ece59e40c112ea,
title = "Subset Multivariate Collective And Point Anomaly Detection",
abstract = "In recent years, there has been a growing interest in identifying anomalous structure within multivariate data sequences. We consider the problem of detecting collective anomalies, corresponding to intervals where one, or more, of the data sequences behaves anomalously. We first develop a test for a single collective anomaly that has power to simultaneously detect anomalies that are either rare, that is affecting few data sequences, or common. We then show how to detect multiple anomalies in a way that is computationally efficient but avoids the approximations inherent in binary segmentation-like approaches. This approach is shown to consistently estimate the number and location of the collective anomalies -- a property that has not previously been shown for competing methods. Our approach can be made robust to point anomalies and can allow for the anomalies to be imperfectly aligned. We show the practical usefulness of allowing for imperfect alignments through a resulting increase in power to detect regions of copy number variation.",
keywords = "Copy number variations, Dynamic programming, Epidemic changepoints, Outliers, Robust statistics",
author = "Alex Fisch and Idris Eckley and Paul Fearnhead",
note = "This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Computational and Graphical Statistics on 19/11/2021, available online: https://www.tandfonline.com/doi/full/10.1080/10618600.2021.1987257",
year = "2022",
month = jun,
day = "30",
doi = "10.1080/10618600.2021.1987257",
language = "English",
volume = "31",
pages = "574--585",
journal = "Journal of Computational and Graphical Statistics",
issn = "1061-8600",
publisher = "American Statistical Association",
number = "2",

}

RIS

TY - JOUR

T1 - Subset Multivariate Collective And Point Anomaly Detection

AU - Fisch, Alex

AU - Eckley, Idris

AU - Fearnhead, Paul

N1 - This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Computational and Graphical Statistics on 19/11/2021, available online: https://www.tandfonline.com/doi/full/10.1080/10618600.2021.1987257

PY - 2022/6/30

Y1 - 2022/6/30

N2 - In recent years, there has been a growing interest in identifying anomalous structure within multivariate data sequences. We consider the problem of detecting collective anomalies, corresponding to intervals where one, or more, of the data sequences behaves anomalously. We first develop a test for a single collective anomaly that has power to simultaneously detect anomalies that are either rare, that is affecting few data sequences, or common. We then show how to detect multiple anomalies in a way that is computationally efficient but avoids the approximations inherent in binary segmentation-like approaches. This approach is shown to consistently estimate the number and location of the collective anomalies -- a property that has not previously been shown for competing methods. Our approach can be made robust to point anomalies and can allow for the anomalies to be imperfectly aligned. We show the practical usefulness of allowing for imperfect alignments through a resulting increase in power to detect regions of copy number variation.

AB - In recent years, there has been a growing interest in identifying anomalous structure within multivariate data sequences. We consider the problem of detecting collective anomalies, corresponding to intervals where one, or more, of the data sequences behaves anomalously. We first develop a test for a single collective anomaly that has power to simultaneously detect anomalies that are either rare, that is affecting few data sequences, or common. We then show how to detect multiple anomalies in a way that is computationally efficient but avoids the approximations inherent in binary segmentation-like approaches. This approach is shown to consistently estimate the number and location of the collective anomalies -- a property that has not previously been shown for competing methods. Our approach can be made robust to point anomalies and can allow for the anomalies to be imperfectly aligned. We show the practical usefulness of allowing for imperfect alignments through a resulting increase in power to detect regions of copy number variation.

KW - Copy number variations

KW - Dynamic programming

KW - Epidemic changepoints

KW - Outliers

KW - Robust statistics

U2 - 10.1080/10618600.2021.1987257

DO - 10.1080/10618600.2021.1987257

M3 - Journal article

VL - 31

SP - 574

EP - 585

JO - Journal of Computational and Graphical Statistics

JF - Journal of Computational and Graphical Statistics

SN - 1061-8600

IS - 2

ER -