Home > Research > Publications & Outputs > Stopping Criteria for Value Iteration on Stocha...

Links

Text available via DOI:

View graph of relations

Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives. / Kretínský, Jan; Meggendorfer, Tobias; Weininger, Maximilian.
2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023. IEEE, 2023. p. 1-14 (Proceedings - Symposium on Logic in Computer Science; Vol. 2023-June).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Kretínský, J, Meggendorfer, T & Weininger, M 2023, Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives. in 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023. Proceedings - Symposium on Logic in Computer Science, vol. 2023-June, IEEE, pp. 1-14. https://doi.org/10.1109/LICS56636.2023.10175771

APA

Kretínský, J., Meggendorfer, T., & Weininger, M. (2023). Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives. In 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023 (pp. 1-14). (Proceedings - Symposium on Logic in Computer Science; Vol. 2023-June). IEEE. https://doi.org/10.1109/LICS56636.2023.10175771

Vancouver

Kretínský J, Meggendorfer T, Weininger M. Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives. In 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023. IEEE. 2023. p. 1-14. (Proceedings - Symposium on Logic in Computer Science). doi: 10.1109/LICS56636.2023.10175771

Author

Kretínský, Jan ; Meggendorfer, Tobias ; Weininger, Maximilian. / Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives. 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023. IEEE, 2023. pp. 1-14 (Proceedings - Symposium on Logic in Computer Science).

Bibtex

@inproceedings{e654eebe90a94a3c9ae6b4b102a5520b,
title = "Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives.",
abstract = "A classic solution technique for Markov decision processes (MDP) and stochastic games (SG) is value iteration (VI). Due to its good practical performance, this approximative approach is typically preferred over exact techniques, even though no practical bounds on the imprecision of the result could be given until recently. As a consequence, even the most used model checkers could return arbitrarily wrong results. Over the past decade, different works derived stopping criteria, indicating when the precision reaches the desired level, for various settings, in particular MDP with reachability, total reward, and mean payoff, and SG with reachability.In this paper, we provide the first stopping criteria for VI on SG with total reward and mean payoff, yielding the first anytime algorithms in these settings. To this end, we provide the solution in two flavours: First through a reduction to the MDP case and second directly on SG. The former is simpler and automatically utilizes any advances on MDP. The latter allows for more local computations, heading towards better practical efficiency.Our solution unifies the previously mentioned approaches for MDP and SG and their underlying ideas. To achieve this, we isolate objective-specific subroutines as well as identify objective-independent concepts. These structural concepts, while surprisingly simple, form the very essence of the unified solution.",
keywords = "Stochastic games, value iteration",
author = "Jan Kret{\'i}nsk{\'y} and Tobias Meggendorfer and Maximilian Weininger",
year = "2023",
month = jul,
day = "14",
doi = "10.1109/LICS56636.2023.10175771",
language = "English",
series = "Proceedings - Symposium on Logic in Computer Science",
publisher = "IEEE",
pages = "1--14",
booktitle = "2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023",

}

RIS

TY - GEN

T1 - Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives.

AU - Kretínský, Jan

AU - Meggendorfer, Tobias

AU - Weininger, Maximilian

PY - 2023/7/14

Y1 - 2023/7/14

N2 - A classic solution technique for Markov decision processes (MDP) and stochastic games (SG) is value iteration (VI). Due to its good practical performance, this approximative approach is typically preferred over exact techniques, even though no practical bounds on the imprecision of the result could be given until recently. As a consequence, even the most used model checkers could return arbitrarily wrong results. Over the past decade, different works derived stopping criteria, indicating when the precision reaches the desired level, for various settings, in particular MDP with reachability, total reward, and mean payoff, and SG with reachability.In this paper, we provide the first stopping criteria for VI on SG with total reward and mean payoff, yielding the first anytime algorithms in these settings. To this end, we provide the solution in two flavours: First through a reduction to the MDP case and second directly on SG. The former is simpler and automatically utilizes any advances on MDP. The latter allows for more local computations, heading towards better practical efficiency.Our solution unifies the previously mentioned approaches for MDP and SG and their underlying ideas. To achieve this, we isolate objective-specific subroutines as well as identify objective-independent concepts. These structural concepts, while surprisingly simple, form the very essence of the unified solution.

AB - A classic solution technique for Markov decision processes (MDP) and stochastic games (SG) is value iteration (VI). Due to its good practical performance, this approximative approach is typically preferred over exact techniques, even though no practical bounds on the imprecision of the result could be given until recently. As a consequence, even the most used model checkers could return arbitrarily wrong results. Over the past decade, different works derived stopping criteria, indicating when the precision reaches the desired level, for various settings, in particular MDP with reachability, total reward, and mean payoff, and SG with reachability.In this paper, we provide the first stopping criteria for VI on SG with total reward and mean payoff, yielding the first anytime algorithms in these settings. To this end, we provide the solution in two flavours: First through a reduction to the MDP case and second directly on SG. The former is simpler and automatically utilizes any advances on MDP. The latter allows for more local computations, heading towards better practical efficiency.Our solution unifies the previously mentioned approaches for MDP and SG and their underlying ideas. To achieve this, we isolate objective-specific subroutines as well as identify objective-independent concepts. These structural concepts, while surprisingly simple, form the very essence of the unified solution.

KW - Stochastic games

KW - value iteration

U2 - 10.1109/LICS56636.2023.10175771

DO - 10.1109/LICS56636.2023.10175771

M3 - Conference contribution/Paper

T3 - Proceedings - Symposium on Logic in Computer Science

SP - 1

EP - 14

BT - 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2023

PB - IEEE

ER -