Sampling properties and empirical estimates of extreme events

School Of Mathematical Sciences

Text available via DOI:

https://doi.org/10.1016/j.oceaneng.2021.109791
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

Confidence interval, Empirical distribution function, Model diagnostics, Plotting position, Return period, Return value, Sampling variability

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Sampling properties and empirical estimates of extreme events. / Mackay, E.; Jonathan, P.
In: Ocean Engineering, Vol. 239, 109791, 01.11.2021.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Mackay, E & Jonathan, P 2021, 'Sampling properties and empirical estimates of extreme events', Ocean Engineering, vol. 239, 109791. https://doi.org/10.1016/j.oceaneng.2021.109791

APA

Mackay, E., & Jonathan, P. (2021). Sampling properties and empirical estimates of extreme events. Ocean Engineering, 239, Article 109791. https://doi.org/10.1016/j.oceaneng.2021.109791

Vancouver

Mackay E, Jonathan P. Sampling properties and empirical estimates of extreme events. Ocean Engineering. 2021 Nov 1;239:109791. Epub 2021 Sept 15. doi: 10.1016/j.oceaneng.2021.109791

Author

Mackay, E. ; Jonathan, P. / Sampling properties and empirical estimates of extreme events. In: Ocean Engineering. 2021 ; Vol. 239.

Bibtex

@article{92e1aafa7efb42f38a545c6273aeec37,

title = "Sampling properties and empirical estimates of extreme events",

abstract = "The statistical characteristics of the largest observations in a sample are highly uncertain. In this work we consider the problem of how to define empirical estimates of exceedance probabilities and return periods associated with an ordered sample of observations. Understanding the sampling properties of these quantities is important for assessing the fit of a statistical model and also for placing confidence bounds on estimates of extreme events from Monte Carlo simulations. The empirical distribution function (EDF) is often defined as the expected non-exceedance probability (NEP) associated with sample order statistics. Yet, due to the non-linearity of the relations between return periods, quantiles and NEP, the return period (or quantile) associated with the expected NEP is not equal to the expected return period (or quantile), leading to ambiguity. However, the sampling distributions of exceedance probabilities, return periods and quantiles are, in fact, linked by a simple relation. From this relation, it follows that defining the EDF in terms of the median NEP of the order statistics gives a consistent framework for defining empirical estimates of all three quantities. We demonstrate that the median value of the return period of the largest observation is 44% larger than the return period calculated using the common definition of the EDF in terms of the expected NEP of the order statistics. We also derive some new results about the size of the confidence intervals for exceedance probabilities and return periods. ",

keywords = "Confidence interval, Empirical distribution function, Model diagnostics, Plotting position, Return period, Return value, Sampling variability",

author = "E. Mackay and P. Jonathan",

year = "2021",

month = nov,

day = "1",

doi = "10.1016/j.oceaneng.2021.109791",

language = "English",

volume = "239",

journal = "Ocean Engineering",

issn = "0029-8018",

publisher = "Elsevier Ltd",

}

RIS

TY - JOUR

T1 - Sampling properties and empirical estimates of extreme events

AU - Mackay, E.

AU - Jonathan, P.

PY - 2021/11/1

Y1 - 2021/11/1

N2 - The statistical characteristics of the largest observations in a sample are highly uncertain. In this work we consider the problem of how to define empirical estimates of exceedance probabilities and return periods associated with an ordered sample of observations. Understanding the sampling properties of these quantities is important for assessing the fit of a statistical model and also for placing confidence bounds on estimates of extreme events from Monte Carlo simulations. The empirical distribution function (EDF) is often defined as the expected non-exceedance probability (NEP) associated with sample order statistics. Yet, due to the non-linearity of the relations between return periods, quantiles and NEP, the return period (or quantile) associated with the expected NEP is not equal to the expected return period (or quantile), leading to ambiguity. However, the sampling distributions of exceedance probabilities, return periods and quantiles are, in fact, linked by a simple relation. From this relation, it follows that defining the EDF in terms of the median NEP of the order statistics gives a consistent framework for defining empirical estimates of all three quantities. We demonstrate that the median value of the return period of the largest observation is 44% larger than the return period calculated using the common definition of the EDF in terms of the expected NEP of the order statistics. We also derive some new results about the size of the confidence intervals for exceedance probabilities and return periods.

AB - The statistical characteristics of the largest observations in a sample are highly uncertain. In this work we consider the problem of how to define empirical estimates of exceedance probabilities and return periods associated with an ordered sample of observations. Understanding the sampling properties of these quantities is important for assessing the fit of a statistical model and also for placing confidence bounds on estimates of extreme events from Monte Carlo simulations. The empirical distribution function (EDF) is often defined as the expected non-exceedance probability (NEP) associated with sample order statistics. Yet, due to the non-linearity of the relations between return periods, quantiles and NEP, the return period (or quantile) associated with the expected NEP is not equal to the expected return period (or quantile), leading to ambiguity. However, the sampling distributions of exceedance probabilities, return periods and quantiles are, in fact, linked by a simple relation. From this relation, it follows that defining the EDF in terms of the median NEP of the order statistics gives a consistent framework for defining empirical estimates of all three quantities. We demonstrate that the median value of the return period of the largest observation is 44% larger than the return period calculated using the common definition of the EDF in terms of the expected NEP of the order statistics. We also derive some new results about the size of the confidence intervals for exceedance probabilities and return periods.

KW - Confidence interval

KW - Empirical distribution function

KW - Model diagnostics

KW - Plotting position

KW - Return period

KW - Return value

KW - Sampling variability

U2 - 10.1016/j.oceaneng.2021.109791

DO - 10.1016/j.oceaneng.2021.109791

M3 - Journal article

VL - 239

JO - Ocean Engineering

JF - Ocean Engineering

SN - 0029-8018

M1 - 109791

ER -

Research

Links

Text available via DOI:

Keywords