Optimal index rules for single resource allocation to stochastic dynamic competitors

Management Science

Associated organisational unit

STOR-i Centre for Doctoral Training

Electronic data

gittins3_sig-alternate
Rights statement: © ACM, 2011.This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools http://doi.acm.org/10.4108/icst.valuetools.2011.245797
Accepted author manuscript, 277 KB, PDF document

Text available via DOI:

https://doi.org/10.4108/icst.valuetools.2011.245797
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Peter Jacko

More...

Publication date	2011
Host publication	VALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools
Place of Publication	New York
Publisher	ACM
Pages	425-433
Number of pages	9
ISBN (print)	978193968091
<mark>Original language</mark>	English
Event	5th International ICST Conference on Performance Evaluation Methodologies and Tools - Paris, France, United Kingdom Duration: 16/05/2011 → 20/05/2011

Conference

Conference	5th International ICST Conference on Performance Evaluation Methodologies and Tools
Country/Territory	United Kingdom
Period	16/05/11 → 20/05/11

Conference

Conference	5th International ICST Conference on Performance Evaluation Methodologies and Tools
Country/Territory	United Kingdom
Period	16/05/11 → 20/05/11

Abstract

In this paper we present a generic Markov decision process model of optimal single resource allocation to a collection of stochastic dynamic competitors. The main goal is to identify sufficient conditions under which this problem is optimally solved by an index rule. The main focus is on the frozen-if-not-allocated assumption, which is notoriously found in problems including the multi-armed bandit problem, tax problem, Klimov network, job sequencing, object search and detection. The problem is approached by a Lagrangian relaxation and decomposed into a collection of normalized parametric single-competitor subproblems, which are then optimally solved by the well-known Gittins index. We show that the problem is equivalent to solving a time sequence of its Lagrangian relaxations. We further show that our approach gives insights on sufficient conditions for optimality of index rules in restless problems (in which the frozen-if-not-allocated assumption is dropped) with single resource; this paper is the first to prove such conditions.

Bibliographic note

© ACM, 2011.This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools http://doi.acm.org/10.4108/icst.valuetools.2011.245797

Research

Associated organisational unit

Electronic data

Links

Text available via DOI: