MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives.

LANCASTER UNIVERSITY LEIPZIG

Text available via DOI:

https://doi.org/10.1007/978-3-031-37709-9_5
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

Markov decision processes, Skolem hardness, distribution transformers, invariant synthesis

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. / Akshay, S.; Chatterjee, Krishnendu; Meggendorfer, Tobias et al.
Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings. ed. / Constantin Enea; Akash Lal. 2023. p. 86-112 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13966 LNCS).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Akshay, S, Chatterjee, K, Meggendorfer, T & Žikelić, Đ 2023, MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. in C Enea & A Lal (eds), Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13966 LNCS, pp. 86-112. https://doi.org/10.1007/978-3-031-37709-9_5

APA

Akshay, S., Chatterjee, K., Meggendorfer, T., & Žikelić, Đ. (2023). MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. In C. Enea, & A. Lal (Eds.), Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings (pp. 86-112). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13966 LNCS). https://doi.org/10.1007/978-3-031-37709-9_5

Vancouver

Akshay S, Chatterjee K, Meggendorfer T, Žikelić Đ. MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. In Enea C, Lal A, editors, Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings. 2023. p. 86-112. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-37709-9_5

Author

Akshay, S. ; Chatterjee, Krishnendu ; Meggendorfer, Tobias et al. / MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings. editor / Constantin Enea ; Akash Lal. 2023. pp. 86-112 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Bibtex

@inproceedings{0e18000df23b496f844e1123e1fc9538,

title = "MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives.",

abstract = "Markov decision processes can be viewed as transformers of probability distributions. While this view is useful from a practical standpoint to reason about trajectories of distributions, basic reachability and safety problems are known to be computationally intractable (i.e., Skolem-hard) to solve in such models. Further, we show that even for simple examples of MDPs, strategies for safety objectives over distributions can require infinite memory and randomization. In light of this, we present a novel overapproximation approach to synthesize strategies in an MDP, such that a safety objective over the distributions is met. More precisely, we develop a new framework for template-based synthesis of certificates as affine distributional and inductive invariants for safety objectives in MDPs. We provide two algorithms within this framework. One can only synthesize memoryless strategies, but has relative completeness guarantees, while the other can synthesize general strategies. The runtime complexity of both algorithms is in PSPACE. We implement these algorithms and show that they can solve several non-trivial examples.",

keywords = "Markov decision processes, Skolem hardness, distribution transformers, invariant synthesis",

author = "S. Akshay and Krishnendu Chatterjee and Tobias Meggendorfer and {\D}or{\d}e {\v Z}ikeli{\'c}",

note = "DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.",

year = "2023",

month = jul,

day = "17",

doi = "10.1007/978-3-031-37709-9_5",

language = "English",

isbn = "9783031377082",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "86--112",

editor = "Constantin Enea and Akash Lal",

booktitle = "Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings",

}

RIS

TY - GEN

T1 - MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives.

AU - Akshay, S.

AU - Chatterjee, Krishnendu

AU - Meggendorfer, Tobias

AU - Žikelić, Đorđe

N1 - DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

PY - 2023/7/17

Y1 - 2023/7/17

N2 - Markov decision processes can be viewed as transformers of probability distributions. While this view is useful from a practical standpoint to reason about trajectories of distributions, basic reachability and safety problems are known to be computationally intractable (i.e., Skolem-hard) to solve in such models. Further, we show that even for simple examples of MDPs, strategies for safety objectives over distributions can require infinite memory and randomization. In light of this, we present a novel overapproximation approach to synthesize strategies in an MDP, such that a safety objective over the distributions is met. More precisely, we develop a new framework for template-based synthesis of certificates as affine distributional and inductive invariants for safety objectives in MDPs. We provide two algorithms within this framework. One can only synthesize memoryless strategies, but has relative completeness guarantees, while the other can synthesize general strategies. The runtime complexity of both algorithms is in PSPACE. We implement these algorithms and show that they can solve several non-trivial examples.

AB - Markov decision processes can be viewed as transformers of probability distributions. While this view is useful from a practical standpoint to reason about trajectories of distributions, basic reachability and safety problems are known to be computationally intractable (i.e., Skolem-hard) to solve in such models. Further, we show that even for simple examples of MDPs, strategies for safety objectives over distributions can require infinite memory and randomization. In light of this, we present a novel overapproximation approach to synthesize strategies in an MDP, such that a safety objective over the distributions is met. More precisely, we develop a new framework for template-based synthesis of certificates as affine distributional and inductive invariants for safety objectives in MDPs. We provide two algorithms within this framework. One can only synthesize memoryless strategies, but has relative completeness guarantees, while the other can synthesize general strategies. The runtime complexity of both algorithms is in PSPACE. We implement these algorithms and show that they can solve several non-trivial examples.

KW - Markov decision processes

KW - Skolem hardness

KW - distribution transformers

KW - invariant synthesis

U2 - 10.1007/978-3-031-37709-9_5

DO - 10.1007/978-3-031-37709-9_5

M3 - Conference contribution/Paper

SN - 9783031377082

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 86

EP - 112

BT - Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings

A2 - Enea, Constantin

A2 - Lal, Akash

ER -

Research

Links

Text available via DOI:

Keywords