Accepted author manuscript, 1.82 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
}
TY - GEN
T1 - Information-guided Planning
T2 - Thirty-seventh Conference on Neural Information Processing Systems
AU - do Carmo Alves, Matheus Aparecido
AU - Varma, Amokh
AU - Soriano Marcolino, Leandro
AU - Elkhatib, Yehia
N1 - Conference code: 37
PY - 2023/9/21
Y1 - 2023/9/21
N2 - This paper presents IB-POMCP, a novel algorithm for online planning under partial observability. Our approach enhances the decision-making process by using estimations of the world belief's entropy to guide a tree search process and surpass the limitations of planning in scenarios with sparse reward configurations. By performing what we denominate as an information-guided planning process, the algorithm, which incorporates a novel I-UCB function, shows significant improvements in reward and reasoning time compared to state-of-the-art baselines in several benchmark scenarios, along with theoretical convergence guarantees.
AB - This paper presents IB-POMCP, a novel algorithm for online planning under partial observability. Our approach enhances the decision-making process by using estimations of the world belief's entropy to guide a tree search process and surpass the limitations of planning in scenarios with sparse reward configurations. By performing what we denominate as an information-guided planning process, the algorithm, which incorporates a novel I-UCB function, shows significant improvements in reward and reasoning time compared to state-of-the-art baselines in several benchmark scenarios, along with theoretical convergence guarantees.
KW - Information-guided planning
KW - Planning under uncertainty
KW - Sequential decision making
UR - https://github.com/lsmcolab/ib-pomcp/
M3 - Conference contribution/Paper
BT - Thirty-seventh Conference on Neural Information Processing Systems
Y2 - 10 December 2023 through 16 December 2023
ER -