Home > Research > Publications & Outputs > A simulation-based approximate dynamic programm...

Associated organisational unit

Links

Text available via DOI:

View graph of relations

A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem

Research output: Contribution to Journal/MagazineJournal articlepeer-review

E-pub ahead of print

Standard

A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem. / Satic, U.; Jacko, P.; Kirkbride, C.
In: European Journal of Operational Research, Vol. 315, No. 2, 01.06.2024, p. 454-469.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Satic U, Jacko P, Kirkbride C. A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem. European Journal of Operational Research. 2024 Jun 1;315(2):454-469. Epub 2024 Feb 13. doi: 10.1016/j.ejor.2023.10.046

Author

Bibtex

@article{27ecb6af9d9c41eaa8fd08e2bec586b3,
title = "A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem",
abstract = "We consider the dynamic and stochastic resource-constrained multi-project scheduling problem which allows for the random arrival of projects and stochastic task durations. Completing projects generates rewards, which are reduced by a tardiness cost in the case of late completion. Multiple types of resource are available, and projects consume different amounts of these resources when under processing. The problem is modelled as an infinite-horizon discrete-time Markov decision process and seeks to maximise the expected discounted long-run profit. We use an approximate dynamic programming algorithm (ADP) with a linear approximation model which can be used for online decision making. Our approximation model uses project elements that are easily accessible by a decision-maker, with the model coefficients obtained offline via a combination of Monte Carlo simulation and least squares estimation. Our numerical study shows that ADP often statistically significantly outperforms the optimal reactive baseline algorithm (ORBA). In experiments on smaller problems however, both typically perform suboptimally compared to the optimal scheduler obtained by stochastic dynamic programming. ADP has an advantage over ORBA and dynamic programming in that ADP can be applied to larger problems. We also show that ADP generally produces statistically significantly higher profits than common algorithms used in practice, such as a rule-based algorithm and a reactive genetic algorithm.",
keywords = "Information Systems and Management, Management Science and Operations Research, Modeling and Simulation, General Computer Science, Industrial and Manufacturing Engineering",
author = "U. Satic and P. Jacko and C. Kirkbride",
year = "2024",
month = feb,
day = "13",
doi = "10.1016/j.ejor.2023.10.046",
language = "English",
volume = "315",
pages = "454--469",
journal = "European Journal of Operational Research",
issn = "0377-2217",
publisher = "Elsevier Science B.V.",
number = "2",

}

RIS

TY - JOUR

T1 - A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem

AU - Satic, U.

AU - Jacko, P.

AU - Kirkbride, C.

PY - 2024/2/13

Y1 - 2024/2/13

N2 - We consider the dynamic and stochastic resource-constrained multi-project scheduling problem which allows for the random arrival of projects and stochastic task durations. Completing projects generates rewards, which are reduced by a tardiness cost in the case of late completion. Multiple types of resource are available, and projects consume different amounts of these resources when under processing. The problem is modelled as an infinite-horizon discrete-time Markov decision process and seeks to maximise the expected discounted long-run profit. We use an approximate dynamic programming algorithm (ADP) with a linear approximation model which can be used for online decision making. Our approximation model uses project elements that are easily accessible by a decision-maker, with the model coefficients obtained offline via a combination of Monte Carlo simulation and least squares estimation. Our numerical study shows that ADP often statistically significantly outperforms the optimal reactive baseline algorithm (ORBA). In experiments on smaller problems however, both typically perform suboptimally compared to the optimal scheduler obtained by stochastic dynamic programming. ADP has an advantage over ORBA and dynamic programming in that ADP can be applied to larger problems. We also show that ADP generally produces statistically significantly higher profits than common algorithms used in practice, such as a rule-based algorithm and a reactive genetic algorithm.

AB - We consider the dynamic and stochastic resource-constrained multi-project scheduling problem which allows for the random arrival of projects and stochastic task durations. Completing projects generates rewards, which are reduced by a tardiness cost in the case of late completion. Multiple types of resource are available, and projects consume different amounts of these resources when under processing. The problem is modelled as an infinite-horizon discrete-time Markov decision process and seeks to maximise the expected discounted long-run profit. We use an approximate dynamic programming algorithm (ADP) with a linear approximation model which can be used for online decision making. Our approximation model uses project elements that are easily accessible by a decision-maker, with the model coefficients obtained offline via a combination of Monte Carlo simulation and least squares estimation. Our numerical study shows that ADP often statistically significantly outperforms the optimal reactive baseline algorithm (ORBA). In experiments on smaller problems however, both typically perform suboptimally compared to the optimal scheduler obtained by stochastic dynamic programming. ADP has an advantage over ORBA and dynamic programming in that ADP can be applied to larger problems. We also show that ADP generally produces statistically significantly higher profits than common algorithms used in practice, such as a rule-based algorithm and a reactive genetic algorithm.

KW - Information Systems and Management

KW - Management Science and Operations Research

KW - Modeling and Simulation

KW - General Computer Science

KW - Industrial and Manufacturing Engineering

U2 - 10.1016/j.ejor.2023.10.046

DO - 10.1016/j.ejor.2023.10.046

M3 - Journal article

VL - 315

SP - 454

EP - 469

JO - European Journal of Operational Research

JF - European Journal of Operational Research

SN - 0377-2217

IS - 2

ER -