Rights statement: This is the author’s version of a work that was accepted for publication in Spatial Statistics. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Spatial Statistics, 50, 100615, 2022 DOI: 10.1016/j.spasta.2022.100615
Accepted author manuscript, 1.2 MB, PDF document
Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Adaptive smoothing to identify spatial structure in global lake ecological processes using satellite remote sensing data
AU - Gong, M.
AU - O'Donnell, R.
AU - Miller, C.
AU - Scott, M.
AU - Simis, S.
AU - Groom, S.
AU - Tyler, A.
AU - Hunter, P.
AU - Spyrakos, E.
AU - Merchant, C.
AU - Maberly, S.
AU - Carvalho, L.
N1 - This is the author’s version of a work that was accepted for publication in Spatial Statistics. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Spatial Statistics, 50, 100615, 2022 DOI: 10.1016/j.spasta.2022.100615
PY - 2022/8/31
Y1 - 2022/8/31
N2 - Satellite remote sensing data are important to the study of environment problems at a global scale. The GloboLakes project aimed to use satellite remote sensing data to investigate the response of the major lakes on Earth to environmental conditions and change. The main challenge to statistical modelling is the identification of the spatial structure in global lake ecological processes from a large number of time series subject to incomplete data and varying uncertainty. This paper introduces a comprehensive modelling procedure, combining adaptive smoothing and functional data analysis, to estimate the smooth curves representing the trend and seasonal patterns in the time series and to cluster the curves over space. Two approaches, based on an irregular basis and an adaptive penalty matrix, are developed to account for the varying uncertainty induced by missing observations and specific constraints (e.g. substantive periods of measurement values of zero in winter). In particular, the adaptive penalty matrix applies a heavier penalty to smooth curve estimates where there is higher uncertainty to prevent over-fitting the noisy/biased data. The modelling procedure was applied to the lake surface water temperature (LSWT) time series from 732 largest lakes globally and the lake chlorophyll-a time series from 535 largest lakes globally. The procedure enabled the identification of nine global lake thermal regions based on the temporal dynamics of LSWT, and the extraction of eight global lake clusters based on the interannual variation in chlorophyll-a and ten clusters to differentiate the seasonal signals.
AB - Satellite remote sensing data are important to the study of environment problems at a global scale. The GloboLakes project aimed to use satellite remote sensing data to investigate the response of the major lakes on Earth to environmental conditions and change. The main challenge to statistical modelling is the identification of the spatial structure in global lake ecological processes from a large number of time series subject to incomplete data and varying uncertainty. This paper introduces a comprehensive modelling procedure, combining adaptive smoothing and functional data analysis, to estimate the smooth curves representing the trend and seasonal patterns in the time series and to cluster the curves over space. Two approaches, based on an irregular basis and an adaptive penalty matrix, are developed to account for the varying uncertainty induced by missing observations and specific constraints (e.g. substantive periods of measurement values of zero in winter). In particular, the adaptive penalty matrix applies a heavier penalty to smooth curve estimates where there is higher uncertainty to prevent over-fitting the noisy/biased data. The modelling procedure was applied to the lake surface water temperature (LSWT) time series from 732 largest lakes globally and the lake chlorophyll-a time series from 535 largest lakes globally. The procedure enabled the identification of nine global lake thermal regions based on the temporal dynamics of LSWT, and the extraction of eight global lake clusters based on the interannual variation in chlorophyll-a and ten clusters to differentiate the seasonal signals.
KW - Adaptive smoothing
KW - Functional data analysis
KW - Satellite remote sensing data
KW - Spatial structure
U2 - 10.1016/j.spasta.2022.100615
DO - 10.1016/j.spasta.2022.100615
M3 - Journal article
VL - 50
JO - Spatial Statistics
JF - Spatial Statistics
SN - 2211-6753
M1 - 100615
ER -