Home > Research > Publications & Outputs > Optimal index rules for single resource allocat...

Electronic data

  • gittins3_sig-alternate

    Rights statement: © ACM, 2011.This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools http://doi.acm.org/10.4108/icst.valuetools.2011.245797

    Accepted author manuscript, 277 KB, PDF document

Links

Text available via DOI:

View graph of relations

Optimal index rules for single resource allocation to stochastic dynamic competitors

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
Publication date2011
Host publicationVALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools
Place of PublicationNew York
PublisherACM
Pages425-433
Number of pages9
ISBN (print)978193968091
<mark>Original language</mark>English
Event5th International ICST Conference on Performance Evaluation Methodologies and Tools - Paris, France, United Kingdom
Duration: 16/05/201120/05/2011

Conference

Conference5th International ICST Conference on Performance Evaluation Methodologies and Tools
Country/TerritoryUnited Kingdom
Period16/05/1120/05/11

Conference

Conference5th International ICST Conference on Performance Evaluation Methodologies and Tools
Country/TerritoryUnited Kingdom
Period16/05/1120/05/11

Abstract

In this paper we present a generic Markov decision process model of optimal single resource allocation to a collection of stochastic dynamic competitors. The main goal is to identify sufficient conditions under which this problem is optimally solved by an index rule. The main focus is on the frozen-if-not-allocated assumption, which is notoriously found in problems including the multi-armed bandit problem, tax problem, Klimov network, job sequencing, object search and detection. The problem is approached by a Lagrangian relaxation and decomposed into a collection of normalized parametric single-competitor subproblems, which are then optimally solved by the well-known Gittins index. We show that the problem is equivalent to solving a time sequence of its Lagrangian relaxations. We further show that our approach gives insights on sufficient conditions for optimality of index rules in restless problems (in which the frozen-if-not-allocated assumption is dropped) with single resource; this paper is the first to prove such conditions.

Bibliographic note

© ACM, 2011.This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VALUETOOLS '11 Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools http://doi.acm.org/10.4108/icst.valuetools.2011.245797