Accepted author manuscript, 830 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License
Final published version
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
}
TY - GEN
T1 - Algorithms or Actions?
T2 - 27th International Joint Conference on Artificial Intelligence, IJCAI 2018
AU - Rocha Tavares, Anderson
AU - Anbalagan, Sivasubramanian
AU - Soriano Marcolino, Leandro
AU - Chaimowicz, Luiz
PY - 2018/7/1
Y1 - 2018/7/1
N2 - Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.
AB - Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.
KW - Machine Learning: Reinforcement Learning
KW - Multidisciplinary Topics and Applications: Computer Games
KW - Uncertainty in AI: Markov Decision Processes
KW - Machine Learning Applications: Game Playing
U2 - 10.24963/ijcai.2018/377
DO - 10.24963/ijcai.2018/377
M3 - Conference contribution/Paper
SN - 9780999241127
SP - 2717
EP - 2723
BT - Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18)
PB - International Joint Conferences on Artificial Intelligence
Y2 - 13 July 2018 through 19 July 2018
ER -