Accepted author manuscript, 4.29 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
<mark>Journal publication date</mark> | 8/11/2024 |
---|---|
<mark>Journal</mark> | Test |
Publication Status | E-pub ahead of print |
Early online date | 8/11/24 |
<mark>Original language</mark> | English |
With the advance of technology, functional data are being recorded more frequently, whether over one-dimensional or multi-dimensional domains. Due to the high dimensionality and complex features of functional data, exploratory data analysis (EDA) faces significant challenges. To meet the demands of practical applications, researchers have developed various EDA tools, including visualization tools, outlier detection techniques, and clustering methods that can handle diverse types of functional data. This paper offers a comprehensive overview of recent procedures for exploratory functional data analysis (EFDA). It begins by introducing fundamental statistical concepts, such as mean and covariance functions, as well as robust statistics such as the median and quantiles in multivariate functional data. Then, the paper reviews popular visualization methods for functional data, such as the rainbow plot, and various versions of the functional boxplot, each designed to accommodate different features of functional data. In addition to visualization tools, the paper also reviews outlier detection methods, which are commonly integrated with visualization methods to identify anomalous patterns within the data. Finally, the paper focuses on functional data clustering techniques which provide another set of practical tools for EFDA. The paper concludes with a brief discussion of future directions for EFDA. All the reviewed methods have been implemented in an R package named EFDA.