Accepted author manuscript, 4.29 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Exploratory functional data analysis
AU - Qu, Z.
AU - Dai, W.
AU - Euan, C.
AU - Sun, Y.
AU - Genton, M.G.
PY - 2024/11/8
Y1 - 2024/11/8
N2 - With the advance of technology, functional data are being recorded more frequently, whether over one-dimensional or multi-dimensional domains. Due to the high dimensionality and complex features of functional data, exploratory data analysis (EDA) faces significant challenges. To meet the demands of practical applications, researchers have developed various EDA tools, including visualization tools, outlier detection techniques, and clustering methods that can handle diverse types of functional data. This paper offers a comprehensive overview of recent procedures for exploratory functional data analysis (EFDA). It begins by introducing fundamental statistical concepts, such as mean and covariance functions, as well as robust statistics such as the median and quantiles in multivariate functional data. Then, the paper reviews popular visualization methods for functional data, such as the rainbow plot, and various versions of the functional boxplot, each designed to accommodate different features of functional data. In addition to visualization tools, the paper also reviews outlier detection methods, which are commonly integrated with visualization methods to identify anomalous patterns within the data. Finally, the paper focuses on functional data clustering techniques which provide another set of practical tools for EFDA. The paper concludes with a brief discussion of future directions for EFDA. All the reviewed methods have been implemented in an R package named EFDA.
AB - With the advance of technology, functional data are being recorded more frequently, whether over one-dimensional or multi-dimensional domains. Due to the high dimensionality and complex features of functional data, exploratory data analysis (EDA) faces significant challenges. To meet the demands of practical applications, researchers have developed various EDA tools, including visualization tools, outlier detection techniques, and clustering methods that can handle diverse types of functional data. This paper offers a comprehensive overview of recent procedures for exploratory functional data analysis (EFDA). It begins by introducing fundamental statistical concepts, such as mean and covariance functions, as well as robust statistics such as the median and quantiles in multivariate functional data. Then, the paper reviews popular visualization methods for functional data, such as the rainbow plot, and various versions of the functional boxplot, each designed to accommodate different features of functional data. In addition to visualization tools, the paper also reviews outlier detection methods, which are commonly integrated with visualization methods to identify anomalous patterns within the data. Finally, the paper focuses on functional data clustering techniques which provide another set of practical tools for EFDA. The paper concludes with a brief discussion of future directions for EFDA. All the reviewed methods have been implemented in an R package named EFDA.
KW - 62A09
KW - 62R10
KW - Clustering
KW - Data visualization
KW - Exploratory data analysis
KW - Functional boxplot
KW - Multivariate functional data
KW - Outlier detection
U2 - 10.1007/s11749-024-00952-8
DO - 10.1007/s11749-024-00952-8
M3 - Journal article
JO - Test
JF - Test
SN - 1133-0686
ER -