Reinforcement learning under uncertainty - Research Portal

Home > Research > Publications & Outputs > Reinforcement learning under uncertainty

School Of Mathematical Sciences

Electronic data

RL_under_uncertainty_Manuscript (Ez-zizi et al., 2022)
Accepted author manuscript, 5.25 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.1007/s42113-022-00165-y
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

Bayesian reinforcement learning, Expected and unexpected uncertainty, Reinforcement learning, Sampling-based learning

View graph of relations

Reinforcement learning under uncertainty: expected versus unexpected uncertainty and state versus reward uncertainty

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Adnane Ez-Zizi
Simon Farrell
David Leslie
Gaurav Malhotra
Casimir J. H. Ludwig

More...

<mark>Journal publication date</mark>	1/12/2023
<mark>Journal</mark>	Computational Brain and Behavior
Issue number	4
Volume	6
Number of pages	25
Pages (from-to)	626-650
Publication Status	Published
Early online date	20/03/23
<mark>Original language</mark>	English

Abstract

Two prominent types of uncertainty that have been studied extensively are expected and unexpected uncertainty. Studies suggest that humans are capable of learning from reward under both expected and unexpected uncertainty when the source of variability is the reward. How do people learn when the source of uncertainty is the environment’s state and the rewards themselves are deterministic? How does their learning compare with the case of reward uncertainty? The present study addressed these questions using behavioural experimentation and computational modelling. Experiment 1 showed that human subjects were generally able to use reward feedback to successfully learn the task rules under state uncertainty, and were able to detect a non-signalled reversal of stimulus-response contingencies. Experiment 2, which combined all four types of uncertainties—expected versus unexpected uncertainty, and state versus reward uncertainty—highlighted key similarities and differences in learning between state and reward uncertainties. We found that subjects performed significantly better in the state uncertainty condition, primarily because they explored less and improved their state disambiguation. We also show that a simple reinforcement learning mechanism that ignores state uncertainty and updates the state-action value of only the identified state accounted for the behavioural data better than both a Bayesian reinforcement learning model that keeps track of belief states and a model that acts based on sampling from past experiences. Our findings suggest a common mechanism supports reward-based learning under state and reward uncertainty.

Research

Electronic data

Links

Text available via DOI:

Keywords

Reinforcement learning under uncertainty: expected versus unexpected uncertainty and state versus reward uncertainty

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us