Home > Research > Publications & Outputs > Methods for extreme value threshold selection a...

Electronic data

  • 2025MurphyPhD

    Final published version, 7.79 MB, PDF document

    Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Text available via DOI:

View graph of relations

Methods for extreme value threshold selection and uncertainty quantification with application to induced seismicity

Research output: ThesisDoctoral Thesis

Published

Standard

Harvard

APA

Vancouver

Author

Bibtex

@phdthesis{662cbc767cda41ae813bab6d4fbdc195,
title = "Methods for extreme value threshold selection and uncertainty quantification with application to induced seismicity",
abstract = "Designing protection mechanisms to safeguard against ever-changing environmental processes requires the accurate estimation of future extreme hazards combined with reliable measures of their uncertainty. This thesis uses the peaks-over-threshold (POT) framework of extreme value modelling to provide high-quality tail inferences. The fundamental problem with POT analyses is selecting the threshold to characterise extreme values, with inferences sensitive to the choice. We develop improved methodology for threshold selection and for quantifying the uncertainty of this selection in tail inferences.Even for independent and identically distributed data, threshold selection is a difficult task. Existing approaches can be subjective, sensitive to tuning parameters, or rely on asymptotics, resulting in suboptimal performance in practice. We develop a novel, objective, and effective methodology to automate threshold selection and propagate its uncertainty through to high quantile inference. We extend these approaches to handle non-identically distributed data with smooth generalised additive model formulations for the threshold and excess distribution parameters. We adapt the methodology to address requirements for important applications. For coastal flooding, we focus the goodness-of-fit metric on the upper tail to ensure that the selected thresholds lead to accurate fitting to the most extreme observations. For modelling induced earthquakes, we use geophysical covariates regarding the measurement network and stresses induced by gas extraction, to form spatio-temporal threshold and excess distribution parameter functions. We develop a powerful estimator for a key quantity in seismicity modelling, the magnitude of completion. This estimator reduces the uncertainty and provides stronger evidence of a finite upper-endpoint than in previous research. We expand our uncertainty algorithms to account for the unknown model-covariate formulation and incorporate this uncertainty in inference for future endpoint summaries and quantile estimates relevant for design standards. Our methods have much wider applicability for inference for other induced seismicity contexts and wider environmental hazards. ",
keywords = "extreme value theory, threshold selection, uncertainty quantification, induced seismicity, coastal flooding, generalised Pareto distribution",
author = "Conor Murphy",
year = "2025",
doi = "10.17635/lancaster/thesis/2909",
language = "English",
publisher = "Lancaster University",
school = "Lancaster University",

}

RIS

TY - BOOK

T1 - Methods for extreme value threshold selection and uncertainty quantification with application to induced seismicity

AU - Murphy, Conor

PY - 2025

Y1 - 2025

N2 - Designing protection mechanisms to safeguard against ever-changing environmental processes requires the accurate estimation of future extreme hazards combined with reliable measures of their uncertainty. This thesis uses the peaks-over-threshold (POT) framework of extreme value modelling to provide high-quality tail inferences. The fundamental problem with POT analyses is selecting the threshold to characterise extreme values, with inferences sensitive to the choice. We develop improved methodology for threshold selection and for quantifying the uncertainty of this selection in tail inferences.Even for independent and identically distributed data, threshold selection is a difficult task. Existing approaches can be subjective, sensitive to tuning parameters, or rely on asymptotics, resulting in suboptimal performance in practice. We develop a novel, objective, and effective methodology to automate threshold selection and propagate its uncertainty through to high quantile inference. We extend these approaches to handle non-identically distributed data with smooth generalised additive model formulations for the threshold and excess distribution parameters. We adapt the methodology to address requirements for important applications. For coastal flooding, we focus the goodness-of-fit metric on the upper tail to ensure that the selected thresholds lead to accurate fitting to the most extreme observations. For modelling induced earthquakes, we use geophysical covariates regarding the measurement network and stresses induced by gas extraction, to form spatio-temporal threshold and excess distribution parameter functions. We develop a powerful estimator for a key quantity in seismicity modelling, the magnitude of completion. This estimator reduces the uncertainty and provides stronger evidence of a finite upper-endpoint than in previous research. We expand our uncertainty algorithms to account for the unknown model-covariate formulation and incorporate this uncertainty in inference for future endpoint summaries and quantile estimates relevant for design standards. Our methods have much wider applicability for inference for other induced seismicity contexts and wider environmental hazards.

AB - Designing protection mechanisms to safeguard against ever-changing environmental processes requires the accurate estimation of future extreme hazards combined with reliable measures of their uncertainty. This thesis uses the peaks-over-threshold (POT) framework of extreme value modelling to provide high-quality tail inferences. The fundamental problem with POT analyses is selecting the threshold to characterise extreme values, with inferences sensitive to the choice. We develop improved methodology for threshold selection and for quantifying the uncertainty of this selection in tail inferences.Even for independent and identically distributed data, threshold selection is a difficult task. Existing approaches can be subjective, sensitive to tuning parameters, or rely on asymptotics, resulting in suboptimal performance in practice. We develop a novel, objective, and effective methodology to automate threshold selection and propagate its uncertainty through to high quantile inference. We extend these approaches to handle non-identically distributed data with smooth generalised additive model formulations for the threshold and excess distribution parameters. We adapt the methodology to address requirements for important applications. For coastal flooding, we focus the goodness-of-fit metric on the upper tail to ensure that the selected thresholds lead to accurate fitting to the most extreme observations. For modelling induced earthquakes, we use geophysical covariates regarding the measurement network and stresses induced by gas extraction, to form spatio-temporal threshold and excess distribution parameter functions. We develop a powerful estimator for a key quantity in seismicity modelling, the magnitude of completion. This estimator reduces the uncertainty and provides stronger evidence of a finite upper-endpoint than in previous research. We expand our uncertainty algorithms to account for the unknown model-covariate formulation and incorporate this uncertainty in inference for future endpoint summaries and quantile estimates relevant for design standards. Our methods have much wider applicability for inference for other induced seismicity contexts and wider environmental hazards.

KW - extreme value theory

KW - threshold selection

KW - uncertainty quantification

KW - induced seismicity

KW - coastal flooding

KW - generalised Pareto distribution

U2 - 10.17635/lancaster/thesis/2909

DO - 10.17635/lancaster/thesis/2909

M3 - Doctoral Thesis

PB - Lancaster University

ER -