Home > Research > Publications & Outputs > Demand forecasting with high dimensional data

Electronic data

  • Ma_Forecasting_with_high_dimensional_data_EJOR_2016_final

    Rights statement: This is the author’s version of a work that was accepted for publication in European Journal of Operational Research. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in European Journal of Operational Research, 249, 1, 2015 DOI: 10.1016/j.eor.2015.08.029

    Accepted author manuscript, 1.63 MB, PDF document

    Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Links

Text available via DOI:

View graph of relations

Demand forecasting with high dimensional data: the case of SKU retail sales forecasting with intra- and inter-category promotional information

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Demand forecasting with high dimensional data: the case of SKU retail sales forecasting with intra- and inter-category promotional information. / Ma, Shaohui; Fildes, Robert Alan; Huang, Tao.
In: European Journal of Operational Research, Vol. 249, No. 1, 16.02.2016, p. 245-257.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Ma S, Fildes RA, Huang T. Demand forecasting with high dimensional data: the case of SKU retail sales forecasting with intra- and inter-category promotional information. European Journal of Operational Research. 2016 Feb 16;249(1):245-257. Epub 2015 Aug 28. doi: 10.1016/j.ejor.2015.08.029

Author

Bibtex

@article{fce38ab20cdc4876afd2a10e946fbe02,
title = "Demand forecasting with high dimensional data: the case of SKU retail sales forecasting with intra- and inter-category promotional information",
abstract = "In marketing analytics applications in OR, the modeler often faces the problem of selecting key variables from a large number of possibilities. For example, SKU level retail store sales are affected by inter and intra category effects which potentially need to be considered when deciding on promotional strategy and producing operational forecasts. But no research has yet put this well accepted concept into forecasting practice: an obvious obstacle is the ultra-high dimensionality of the variable space. This paper develops a four steps methodological framework to overcome the problem. It is illustrated by investigating the value of both intra- and inter-category SKU level promotional information in improving forecast accuracy. The method consists of the identification of potentially influential categories, the building of the explanatory variable space, variable selection and model estimation by a multistage LASSO regression, and the use of a rolling scheme to generate forecasts. The success of this new method for dealing with high dimensionality is demonstrated by improvements in forecasting accuracy compared to alternative methods of simplifying the variable space. The empirical results show that models integrating more information perform significantly better than the baseline model when using the proposed methodology framework. In general, we can improve the forecasting accuracy by 12.6 percent over the model using only the SKU's own predictors. But of the improvements achieved, 95 percent of it comes from the intra-category information, and only 5 percent from the inter-category information. The substantive marketing results also have implications for promotional category management.",
keywords = "Analytics, OR in marketing, Forecasting, Retailing, Promotions",
author = "Shaohui Ma and Fildes, {Robert Alan} and Tao Huang",
note = "This is the author{\textquoteright}s version of a work that was accepted for publication in European Journal of Operational Research. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in European Journal of Operational Research, 249, 1, 2015 DOI: 10.1016/j.eor.2015.08.029",
year = "2016",
month = feb,
day = "16",
doi = "10.1016/j.ejor.2015.08.029",
language = "English",
volume = "249",
pages = "245--257",
journal = "European Journal of Operational Research",
issn = "0377-2217",
publisher = "Elsevier Science B.V.",
number = "1",

}

RIS

TY - JOUR

T1 - Demand forecasting with high dimensional data

T2 - the case of SKU retail sales forecasting with intra- and inter-category promotional information

AU - Ma, Shaohui

AU - Fildes, Robert Alan

AU - Huang, Tao

N1 - This is the author’s version of a work that was accepted for publication in European Journal of Operational Research. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in European Journal of Operational Research, 249, 1, 2015 DOI: 10.1016/j.eor.2015.08.029

PY - 2016/2/16

Y1 - 2016/2/16

N2 - In marketing analytics applications in OR, the modeler often faces the problem of selecting key variables from a large number of possibilities. For example, SKU level retail store sales are affected by inter and intra category effects which potentially need to be considered when deciding on promotional strategy and producing operational forecasts. But no research has yet put this well accepted concept into forecasting practice: an obvious obstacle is the ultra-high dimensionality of the variable space. This paper develops a four steps methodological framework to overcome the problem. It is illustrated by investigating the value of both intra- and inter-category SKU level promotional information in improving forecast accuracy. The method consists of the identification of potentially influential categories, the building of the explanatory variable space, variable selection and model estimation by a multistage LASSO regression, and the use of a rolling scheme to generate forecasts. The success of this new method for dealing with high dimensionality is demonstrated by improvements in forecasting accuracy compared to alternative methods of simplifying the variable space. The empirical results show that models integrating more information perform significantly better than the baseline model when using the proposed methodology framework. In general, we can improve the forecasting accuracy by 12.6 percent over the model using only the SKU's own predictors. But of the improvements achieved, 95 percent of it comes from the intra-category information, and only 5 percent from the inter-category information. The substantive marketing results also have implications for promotional category management.

AB - In marketing analytics applications in OR, the modeler often faces the problem of selecting key variables from a large number of possibilities. For example, SKU level retail store sales are affected by inter and intra category effects which potentially need to be considered when deciding on promotional strategy and producing operational forecasts. But no research has yet put this well accepted concept into forecasting practice: an obvious obstacle is the ultra-high dimensionality of the variable space. This paper develops a four steps methodological framework to overcome the problem. It is illustrated by investigating the value of both intra- and inter-category SKU level promotional information in improving forecast accuracy. The method consists of the identification of potentially influential categories, the building of the explanatory variable space, variable selection and model estimation by a multistage LASSO regression, and the use of a rolling scheme to generate forecasts. The success of this new method for dealing with high dimensionality is demonstrated by improvements in forecasting accuracy compared to alternative methods of simplifying the variable space. The empirical results show that models integrating more information perform significantly better than the baseline model when using the proposed methodology framework. In general, we can improve the forecasting accuracy by 12.6 percent over the model using only the SKU's own predictors. But of the improvements achieved, 95 percent of it comes from the intra-category information, and only 5 percent from the inter-category information. The substantive marketing results also have implications for promotional category management.

KW - Analytics

KW - OR in marketing

KW - Forecasting

KW - Retailing

KW - Promotions

U2 - 10.1016/j.ejor.2015.08.029

DO - 10.1016/j.ejor.2015.08.029

M3 - Journal article

VL - 249

SP - 245

EP - 257

JO - European Journal of Operational Research

JF - European Journal of Operational Research

SN - 0377-2217

IS - 1

ER -