A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem

School Of Mathematical Sciences

Electronic data

Polling_LA_IEEE_Papadimitriou
Accepted author manuscript, 813 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.1109/TSMC.2024.3414832
Final published version

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

E-pub ahead of print

Standard

A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem. / Yazidi, Anis; Hammer, Hugo; Leslie, David.
In: IEEE Transactions on Systems, Man, and Cybernetics: Systems, 17.09.2024.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Yazidi, A, Hammer, H & Leslie, D 2024, 'A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem', IEEE Transactions on Systems, Man, and Cybernetics: Systems. https://doi.org/10.1109/TSMC.2024.3414832

APA

Yazidi, A., Hammer, H., & Leslie, D. (2024). A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem. IEEE Transactions on Systems, Man, and Cybernetics: Systems. Advance online publication. https://doi.org/10.1109/TSMC.2024.3414832

Vancouver

Yazidi A, Hammer H, Leslie D. A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2024 Sept 17. Epub 2024 Sept 17. doi: 10.1109/TSMC.2024.3414832

Author

Yazidi, Anis ; Hammer, Hugo ; Leslie, David. / A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem. In: IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2024.

Bibtex

@article{c088e9d93afa4d178de362ae1571e4e0,

title = "A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem",

abstract = "In this article, we introduce a novel learning automata (LA) solution to the nonlinear stochastic proportional polling (NSPP) problem. The only available solution to this problem in the literature is that given by Nicopolitidis et al. (2003), Obaidat et al. (2002), and Papadimitriou et al. (2002). It was shown to solve a large set of the adaptive resource allocation problems under noisy environments (Nicopolitidis et al., 2003; Obaidat et al., 2002; Papadimitriou and Pomportsis, 2000 and 1999; Nicopolitidis et al., 2004; Obaidat et al., 2001; and Papadimitriou and Pomportsis, 2000). We make a threefold contribution. First, we take a two-timescale approach to the field of LA by estimating the reward probabilities on a faster timescale than the timescale for updating the polling probabilities. Second, by making a not-obvious choice of the objective function, we show that the NSPP problem is indeed an instantiation of the stochastic nonlinear fractional equality knapsack (NFEK) problem, which is a substantial resource allocation problem based on the incomplete and noisy information (Granmo and Oommen, 2010). Third, in contrast to the legacy approach taken by Papadimitriou and Maritsas (1992 and 1996), we show through the extensive experimental results that our solution is remarkably robust to the choice of tuning parameters and that it outperforms the state of the art solution in terms of the Bayesian expected loss.",

author = "Anis Yazidi and Hugo Hammer and David Leslie",

year = "2024",

month = sep,

day = "17",

doi = "10.1109/TSMC.2024.3414832",

language = "English",

journal = "IEEE Transactions on Systems, Man, and Cybernetics: Systems",

issn = "2168-2216",

publisher = "IEEE Advancing Technology for Humanity",

}

RIS

TY - JOUR

T1 - A Two-Timescale Learning Automata Solution to the Nonlinear Stochastic Proportional Polling Problem

AU - Yazidi, Anis

AU - Hammer, Hugo

AU - Leslie, David

PY - 2024/9/17

Y1 - 2024/9/17

N2 - In this article, we introduce a novel learning automata (LA) solution to the nonlinear stochastic proportional polling (NSPP) problem. The only available solution to this problem in the literature is that given by Nicopolitidis et al. (2003), Obaidat et al. (2002), and Papadimitriou et al. (2002). It was shown to solve a large set of the adaptive resource allocation problems under noisy environments (Nicopolitidis et al., 2003; Obaidat et al., 2002; Papadimitriou and Pomportsis, 2000 and 1999; Nicopolitidis et al., 2004; Obaidat et al., 2001; and Papadimitriou and Pomportsis, 2000). We make a threefold contribution. First, we take a two-timescale approach to the field of LA by estimating the reward probabilities on a faster timescale than the timescale for updating the polling probabilities. Second, by making a not-obvious choice of the objective function, we show that the NSPP problem is indeed an instantiation of the stochastic nonlinear fractional equality knapsack (NFEK) problem, which is a substantial resource allocation problem based on the incomplete and noisy information (Granmo and Oommen, 2010). Third, in contrast to the legacy approach taken by Papadimitriou and Maritsas (1992 and 1996), we show through the extensive experimental results that our solution is remarkably robust to the choice of tuning parameters and that it outperforms the state of the art solution in terms of the Bayesian expected loss.

AB - In this article, we introduce a novel learning automata (LA) solution to the nonlinear stochastic proportional polling (NSPP) problem. The only available solution to this problem in the literature is that given by Nicopolitidis et al. (2003), Obaidat et al. (2002), and Papadimitriou et al. (2002). It was shown to solve a large set of the adaptive resource allocation problems under noisy environments (Nicopolitidis et al., 2003; Obaidat et al., 2002; Papadimitriou and Pomportsis, 2000 and 1999; Nicopolitidis et al., 2004; Obaidat et al., 2001; and Papadimitriou and Pomportsis, 2000). We make a threefold contribution. First, we take a two-timescale approach to the field of LA by estimating the reward probabilities on a faster timescale than the timescale for updating the polling probabilities. Second, by making a not-obvious choice of the objective function, we show that the NSPP problem is indeed an instantiation of the stochastic nonlinear fractional equality knapsack (NFEK) problem, which is a substantial resource allocation problem based on the incomplete and noisy information (Granmo and Oommen, 2010). Third, in contrast to the legacy approach taken by Papadimitriou and Maritsas (1992 and 1996), we show through the extensive experimental results that our solution is remarkably robust to the choice of tuning parameters and that it outperforms the state of the art solution in terms of the Bayesian expected loss.

U2 - 10.1109/TSMC.2024.3414832

DO - 10.1109/TSMC.2024.3414832

M3 - Journal article

JO - IEEE Transactions on Systems, Man, and Cybernetics: Systems

JF - IEEE Transactions on Systems, Man, and Cybernetics: Systems

SN - 2168-2216

ER -

Research

Electronic data

Links

Text available via DOI: