Home > Research > Publications & Outputs > Causally Deterministic Markov Decision Processes.

Links

Text available via DOI:

View graph of relations

Causally Deterministic Markov Decision Processes.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Causally Deterministic Markov Decision Processes. / Akshay, S.; Meggendorfer, Tobias; Thiagarajan, P. S.
35th International Conference on Concurrency Theory, CONCUR 2024. ed. / Rupak Majumdar; Alexandra Silva. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2024. p. 6:1-6:22 6 (Leibniz International Proceedings in Informatics, LIPIcs; Vol. 311).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Akshay, S, Meggendorfer, T & Thiagarajan, PS 2024, Causally Deterministic Markov Decision Processes. in R Majumdar & A Silva (eds), 35th International Conference on Concurrency Theory, CONCUR 2024., 6, Leibniz International Proceedings in Informatics, LIPIcs, vol. 311, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, pp. 6:1-6:22. https://doi.org/10.4230/LIPIcs.CONCUR.2024.6

APA

Akshay, S., Meggendorfer, T., & Thiagarajan, P. S. (2024). Causally Deterministic Markov Decision Processes. In R. Majumdar, & A. Silva (Eds.), 35th International Conference on Concurrency Theory, CONCUR 2024 (pp. 6:1-6:22). Article 6 (Leibniz International Proceedings in Informatics, LIPIcs; Vol. 311). Schloss Dagstuhl - Leibniz-Zentrum für Informatik. https://doi.org/10.4230/LIPIcs.CONCUR.2024.6

Vancouver

Akshay S, Meggendorfer T, Thiagarajan PS. Causally Deterministic Markov Decision Processes. In Majumdar R, Silva A, editors, 35th International Conference on Concurrency Theory, CONCUR 2024. Schloss Dagstuhl - Leibniz-Zentrum für Informatik. 2024. p. 6:1-6:22. 6. (Leibniz International Proceedings in Informatics, LIPIcs). Epub 2024 Aug 29. doi: 10.4230/LIPIcs.CONCUR.2024.6

Author

Akshay, S. ; Meggendorfer, Tobias ; Thiagarajan, P. S. / Causally Deterministic Markov Decision Processes. 35th International Conference on Concurrency Theory, CONCUR 2024. editor / Rupak Majumdar ; Alexandra Silva. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2024. pp. 6:1-6:22 (Leibniz International Proceedings in Informatics, LIPIcs).

Bibtex

@inproceedings{2922387baa4947f2b7702917efedabbc,
title = "Causally Deterministic Markov Decision Processes.",
abstract = "Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.",
author = "S. Akshay and Tobias Meggendorfer and Thiagarajan, {P. S.}",
note = "DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.",
year = "2024",
month = sep,
day = "9",
doi = "10.4230/LIPIcs.CONCUR.2024.6",
language = "English",
series = "Leibniz International Proceedings in Informatics, LIPIcs",
publisher = "Schloss Dagstuhl - Leibniz-Zentrum f{\"u}r Informatik",
pages = "6:1--6:22",
editor = "Rupak Majumdar and Alexandra Silva",
booktitle = "35th International Conference on Concurrency Theory, CONCUR 2024",

}

RIS

TY - GEN

T1 - Causally Deterministic Markov Decision Processes.

AU - Akshay, S.

AU - Meggendorfer, Tobias

AU - Thiagarajan, P. S.

N1 - DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

PY - 2024/9/9

Y1 - 2024/9/9

N2 - Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.

AB - Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.

U2 - 10.4230/LIPIcs.CONCUR.2024.6

DO - 10.4230/LIPIcs.CONCUR.2024.6

M3 - Conference contribution/Paper

T3 - Leibniz International Proceedings in Informatics, LIPIcs

SP - 6:1-6:22

BT - 35th International Conference on Concurrency Theory, CONCUR 2024

A2 - Majumdar, Rupak

A2 - Silva, Alexandra

PB - Schloss Dagstuhl - Leibniz-Zentrum für Informatik

ER -