Home > Research > Publications & Outputs > gfpop

Links

Text available via DOI:

View graph of relations

gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection. / Runge, Vincent; Hocking, Toby Dylan; Romano, Gaetano et al.
In: Journal of Statistical Software, Vol. 106, No. 6, 27.03.2023, p. 1–39.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Runge, V, Hocking, TD, Romano, G, Afghah, F, Fearnhead, P & Rigaill, G 2023, 'gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection', Journal of Statistical Software, vol. 106, no. 6, pp. 1–39. https://doi.org/10.18637/jss.v106.i06

APA

Runge, V., Hocking, T. D., Romano, G., Afghah, F., Fearnhead, P., & Rigaill, G. (2023). gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection. Journal of Statistical Software, 106(6), 1–39. https://doi.org/10.18637/jss.v106.i06

Vancouver

Runge V, Hocking TD, Romano G, Afghah F, Fearnhead P, Rigaill G. gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection. Journal of Statistical Software. 2023 Mar 27;106(6):1–39. doi: 10.18637/jss.v106.i06

Author

Runge, Vincent ; Hocking, Toby Dylan ; Romano, Gaetano et al. / gfpop : An R Package for Univariate Graph-Constrained Change-Point Detection. In: Journal of Statistical Software. 2023 ; Vol. 106, No. 6. pp. 1–39.

Bibtex

@article{6d9d2e565ad64a65bc57573cb336b625,
title = "gfpop: An R Package for Univariate Graph-Constrained Change-Point Detection",
abstract = "In a world with data that change rapidly and abruptly, it is important to detect those changes accurately. In this paper we describe an R package implementing a generalized version of an algorithm recently proposed by Hocking, Rigaill, Fearnhead, and Bourque (2020) for penalized maximum likelihood inference of constrained multiple change-point models. This algorithm can be used to pinpoint the precise locations of abrupt changes in large data sequences. There are many application domains for such models, such as medicine, neuroscience or genomics. Often, practitioners have prior knowledge about the changes they are looking for. For example in genomic data, biologists sometimes expect peaks: up changes followed by down changes. Taking advantage of such prior information can substantially improve the accuracy with which we can detect and estimate changes. Hocking et al. (2020) described a graph framework to encode many examples of such prior information and a generic algorithm to infer the optimal model parameters, but implemented the algorithm for just a single scenario. We present the gfpop package that implements the algorithm in a generic manner in R/C++. gfpop works for a user-defined graph that can encode prior assumptions about the types of changes that are possible and implements several loss functions (Gauss, Poisson, binomial, biweight, and Huber). We then illustrate the use of gfpop on isotonic simulations and several applications in biology. For a number of graphs the algorithm runs in a matter of seconds or minutes for 105 data points.",
keywords = "change-point detection, constrained inference, maximum likelihood inference, dynamic programming, robust losses",
author = "Vincent Runge and Hocking, {Toby Dylan} and Gaetano Romano and Fatemeh Afghah and Paul Fearnhead and Guillem Rigaill",
year = "2023",
month = mar,
day = "27",
doi = "10.18637/jss.v106.i06",
language = "English",
volume = "106",
pages = "1–39",
journal = "Journal of Statistical Software",
issn = "1548-7660",
publisher = "University of California at Los Angeles",
number = "6",

}

RIS

TY - JOUR

T1 - gfpop

T2 - An R Package for Univariate Graph-Constrained Change-Point Detection

AU - Runge, Vincent

AU - Hocking, Toby Dylan

AU - Romano, Gaetano

AU - Afghah, Fatemeh

AU - Fearnhead, Paul

AU - Rigaill, Guillem

PY - 2023/3/27

Y1 - 2023/3/27

N2 - In a world with data that change rapidly and abruptly, it is important to detect those changes accurately. In this paper we describe an R package implementing a generalized version of an algorithm recently proposed by Hocking, Rigaill, Fearnhead, and Bourque (2020) for penalized maximum likelihood inference of constrained multiple change-point models. This algorithm can be used to pinpoint the precise locations of abrupt changes in large data sequences. There are many application domains for such models, such as medicine, neuroscience or genomics. Often, practitioners have prior knowledge about the changes they are looking for. For example in genomic data, biologists sometimes expect peaks: up changes followed by down changes. Taking advantage of such prior information can substantially improve the accuracy with which we can detect and estimate changes. Hocking et al. (2020) described a graph framework to encode many examples of such prior information and a generic algorithm to infer the optimal model parameters, but implemented the algorithm for just a single scenario. We present the gfpop package that implements the algorithm in a generic manner in R/C++. gfpop works for a user-defined graph that can encode prior assumptions about the types of changes that are possible and implements several loss functions (Gauss, Poisson, binomial, biweight, and Huber). We then illustrate the use of gfpop on isotonic simulations and several applications in biology. For a number of graphs the algorithm runs in a matter of seconds or minutes for 105 data points.

AB - In a world with data that change rapidly and abruptly, it is important to detect those changes accurately. In this paper we describe an R package implementing a generalized version of an algorithm recently proposed by Hocking, Rigaill, Fearnhead, and Bourque (2020) for penalized maximum likelihood inference of constrained multiple change-point models. This algorithm can be used to pinpoint the precise locations of abrupt changes in large data sequences. There are many application domains for such models, such as medicine, neuroscience or genomics. Often, practitioners have prior knowledge about the changes they are looking for. For example in genomic data, biologists sometimes expect peaks: up changes followed by down changes. Taking advantage of such prior information can substantially improve the accuracy with which we can detect and estimate changes. Hocking et al. (2020) described a graph framework to encode many examples of such prior information and a generic algorithm to infer the optimal model parameters, but implemented the algorithm for just a single scenario. We present the gfpop package that implements the algorithm in a generic manner in R/C++. gfpop works for a user-defined graph that can encode prior assumptions about the types of changes that are possible and implements several loss functions (Gauss, Poisson, binomial, biweight, and Huber). We then illustrate the use of gfpop on isotonic simulations and several applications in biology. For a number of graphs the algorithm runs in a matter of seconds or minutes for 105 data points.

KW - change-point detection

KW - constrained inference

KW - maximum likelihood inference

KW - dynamic programming

KW - robust losses

U2 - 10.18637/jss.v106.i06

DO - 10.18637/jss.v106.i06

M3 - Journal article

VL - 106

SP - 1

EP - 39

JO - Journal of Statistical Software

JF - Journal of Statistical Software

SN - 1548-7660

IS - 6

ER -