Filtered Poisson process bandit on a continuum

Home > Research > Publications & Outputs > Filtered Poisson process bandit on a continuum

School Of Mathematical Sciences

Associated organisational unit

Statistical Artificial Intelligence

Electronic data

FPPBanditEJOR-9
Accepted author manuscript, 4.27 MB, PDF document
Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Text available via DOI:

https://doi.org/10.1016/j.ejor.2021.03.033
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

Applied probability, Poisson processes, Multi-armed bandit, Machine learning

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

More...

<mark>Journal publication date</mark>	31/12/2021
<mark>Journal</mark>	European Journal of Operational Research
Issue number	2
Volume	295
Number of pages	12
Pages (from-to)	575-586
Publication Status	Published
Early online date	24/03/21
<mark>Original language</mark>	English

Abstract

We consider a version of the continuum armed bandit where an action induces a filtered realisation of a non-homogeneous Poisson process. Point data in the filtered sample are then revealed to the decision-maker, whose reward is the total number of revealed points. Using knowledge of the function governing the filtering, but without knowledge of the Poisson intensity function, the decision-maker seeks to maximise the expected number of revealed points over T rounds. We propose an upper confidence bound algorithm for this problem utilising data-adaptive discretisation of the action space. This approach enjoys \tilde{O}(T^(2/3)) regret under a Lipschitz assumption on the reward function. We provide lower bounds on the regret of any algorithm for the problem, via new lower bounds for related finite-armed bandits, and show that the orders of the upper and lower bounds match up to a logarithmic factor.

Research

Associated organisational unit

Electronic data

Links

Text available via DOI:

Keywords

Filtered Poisson process bandit on a continuum

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us