Input Variable Selection for Time Series Forecasting with Artificial Neural Networks: An Empirical Evaluation across Varying Time Series Frenquencies.

Lancaster University

Electronic data

11003735.pdf
Final published version, 9.1 MB, PDF document
Available under license: CC BY-ND

Keywords

MiAaPQ, Economics., Business administration.

View graph of relations

Research output: Thesis › Doctoral Thesis

Unpublished

Standard

Input Variable Selection for Time Series Forecasting with Artificial Neural Networks: An Empirical Evaluation across Varying Time Series Frenquencies. / Kourentzes, Nikolaos.
Lancaster: Lancaster University, 2009. 239 p.

Research output: Thesis › Doctoral Thesis

Bibtex

@phdthesis{7e27f80bbfb04d56ae41540ad766f855,

title = "Input Variable Selection for Time Series Forecasting with Artificial Neural Networks: An Empirical Evaluation across Varying Time Series Frenquencies.",

abstract = "Over the last two decades there has been an increase in the research of artificial neural networks (ANNs) to forecasting problems. Both in theoretical and empirical works, ANNs have shown evidence of good performance, in many cases outperforming established statistical benchmarks. This thesis starts by reviewing the advances in ANNs for time series forecasting, assessing their performance in the literature, analysing the current state of the art, the modelling issues that have been solved and which are still critical for forecasting with ANNs, thereby indicating future research directions. The specification of the input vector is identified as the most crucial unresolved modelling issue for ANNs' accuracy. Notably, there is no rigorous empirical evaluation of the multiple published input variable selection methodologies. This problem is addressed from four different perspectives. A rigorous evaluation of several published methodologies, along with new proposed variations, is performed on low frequency data, exploring which input variable selection methodologies perform best. This analysis concludes that regression based methodologies outperformed other linear and nonlinear ones. The best way to code deterministic seasonality in the inputs of the ANNs is explored, a topic overlooked in the ANN literature, and a parsimonious encoding based on seasonal indices is proposed. The effect of the frequency of the time series on specifying the inputs for ANNs for forecasting is evaluated, revealing several challenges in modelling high frequency time series and providing evidence that the performance of several input variable specification methodologies is not consistent for different data frequencies. This leads to an evaluation of methodologies to select input variables for ANNs solely for high frequency data. Regression based methodologies are found to perform best, in agreement with the evaluation on low frequency dataset, while the ranking of the remaining methodologies is found to be inconsistent for different data frequencies.",

keywords = "MiAaPQ, Economics., Business administration.",

author = "Nikolaos Kourentzes",

note = "Thesis (Ph.D.)--Lancaster University (United Kingdom), 2009.",

year = "2009",

language = "English",

publisher = "Lancaster University",

school = "Lancaster University",

}

RIS

TY - BOOK

T1 - Input Variable Selection for Time Series Forecasting with Artificial Neural Networks: An Empirical Evaluation across Varying Time Series Frenquencies.

AU - Kourentzes, Nikolaos

N1 - Thesis (Ph.D.)--Lancaster University (United Kingdom), 2009.

PY - 2009

Y1 - 2009

N2 - Over the last two decades there has been an increase in the research of artificial neural networks (ANNs) to forecasting problems. Both in theoretical and empirical works, ANNs have shown evidence of good performance, in many cases outperforming established statistical benchmarks. This thesis starts by reviewing the advances in ANNs for time series forecasting, assessing their performance in the literature, analysing the current state of the art, the modelling issues that have been solved and which are still critical for forecasting with ANNs, thereby indicating future research directions. The specification of the input vector is identified as the most crucial unresolved modelling issue for ANNs' accuracy. Notably, there is no rigorous empirical evaluation of the multiple published input variable selection methodologies. This problem is addressed from four different perspectives. A rigorous evaluation of several published methodologies, along with new proposed variations, is performed on low frequency data, exploring which input variable selection methodologies perform best. This analysis concludes that regression based methodologies outperformed other linear and nonlinear ones. The best way to code deterministic seasonality in the inputs of the ANNs is explored, a topic overlooked in the ANN literature, and a parsimonious encoding based on seasonal indices is proposed. The effect of the frequency of the time series on specifying the inputs for ANNs for forecasting is evaluated, revealing several challenges in modelling high frequency time series and providing evidence that the performance of several input variable specification methodologies is not consistent for different data frequencies. This leads to an evaluation of methodologies to select input variables for ANNs solely for high frequency data. Regression based methodologies are found to perform best, in agreement with the evaluation on low frequency dataset, while the ranking of the remaining methodologies is found to be inconsistent for different data frequencies.

AB - Over the last two decades there has been an increase in the research of artificial neural networks (ANNs) to forecasting problems. Both in theoretical and empirical works, ANNs have shown evidence of good performance, in many cases outperforming established statistical benchmarks. This thesis starts by reviewing the advances in ANNs for time series forecasting, assessing their performance in the literature, analysing the current state of the art, the modelling issues that have been solved and which are still critical for forecasting with ANNs, thereby indicating future research directions. The specification of the input vector is identified as the most crucial unresolved modelling issue for ANNs' accuracy. Notably, there is no rigorous empirical evaluation of the multiple published input variable selection methodologies. This problem is addressed from four different perspectives. A rigorous evaluation of several published methodologies, along with new proposed variations, is performed on low frequency data, exploring which input variable selection methodologies perform best. This analysis concludes that regression based methodologies outperformed other linear and nonlinear ones. The best way to code deterministic seasonality in the inputs of the ANNs is explored, a topic overlooked in the ANN literature, and a parsimonious encoding based on seasonal indices is proposed. The effect of the frequency of the time series on specifying the inputs for ANNs for forecasting is evaluated, revealing several challenges in modelling high frequency time series and providing evidence that the performance of several input variable specification methodologies is not consistent for different data frequencies. This leads to an evaluation of methodologies to select input variables for ANNs solely for high frequency data. Regression based methodologies are found to perform best, in agreement with the evaluation on low frequency dataset, while the ranking of the remaining methodologies is found to be inconsistent for different data frequencies.

KW - MiAaPQ

KW - Economics.

KW - Business administration.

M3 - Doctoral Thesis

PB - Lancaster University

CY - Lancaster

ER -

Research

Electronic data

Keywords