Towards Large Scale Ad-hoc Teamwork

Associated organisational units

Electronic data

ICA2018
Accepted author manuscript, 585 KB, PDF document

Text available via DOI:

https://doi.org/10.1109/AGENTS.2018.8460136
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

Towards Large Scale Ad-hoc Teamwork. / Shafipour Yourdshahi, Elnaz ; Pinder, Thomas; Dhawan, Gauri et al.
2018 IEEE International Conference on Agents (ICA). IEEE, 2018.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Shafipour Yourdshahi, E , Pinder, T, Dhawan, G, Soriano Marcolino, L & Angelov, PP 2018, Towards Large Scale Ad-hoc Teamwork. in 2018 IEEE International Conference on Agents (ICA). IEEE. https://doi.org/10.1109/AGENTS.2018.8460136

APA

Shafipour Yourdshahi, E., Pinder, T., Dhawan, G., Soriano Marcolino, L., & Angelov, P. P. (2018). Towards Large Scale Ad-hoc Teamwork. In 2018 IEEE International Conference on Agents (ICA) IEEE. https://doi.org/10.1109/AGENTS.2018.8460136

Vancouver

Shafipour Yourdshahi E , Pinder T, Dhawan G, Soriano Marcolino L , Angelov PP. Towards Large Scale Ad-hoc Teamwork. In 2018 IEEE International Conference on Agents (ICA). IEEE. 2018 doi: 10.1109/AGENTS.2018.8460136

Author

Shafipour Yourdshahi, Elnaz ; Pinder, Thomas ; Dhawan, Gauri et al. / Towards Large Scale Ad-hoc Teamwork. 2018 IEEE International Conference on Agents (ICA). IEEE, 2018.

Bibtex

@inproceedings{d10733a1542648baba761f97d0489e2d,

title = "Towards Large Scale Ad-hoc Teamwork",

abstract = "In complex environments, agents must be able to cooperate with previously unknown team-mates, and hence dynamically learn about other agents in the environment while searching for optimal actions. Previous works employ Monte Carlo Tree Search approaches. However, the search tree increases exponentially with the number of agents, and only scenarios with very small team sizes have been explored. Hence, in this paper we propose a history-based version of UCT Monte Carlo Tree Search, using a more compact representation than the original algorithm. We perform several experiments with a varying number of agents in the level-based foraging domain, an important testbed for ad-hoc teamwork. We achieve better overall performance than the state-of-the-art and better scalability with team size. Additionally, we contribute an open-source version of our system, making it easier for the research community to use the level-based foraging domain as a benchmark problern for ad-hoc teamwork.",

author = "{Shafipour Yourdshahi}, Elnaz and Thomas Pinder and Gauri Dhawan and {Soriano Marcolino}, Leandro and Angelov, {Plamen Parvanov}",

year = "2018",

month = sep,

day = "13",

doi = "10.1109/AGENTS.2018.8460136",

language = "English",

booktitle = "2018 IEEE International Conference on Agents (ICA)",

publisher = "IEEE",

}

RIS

TY - GEN

T1 - Towards Large Scale Ad-hoc Teamwork

AU - Shafipour Yourdshahi, Elnaz

AU - Pinder, Thomas

AU - Dhawan, Gauri

AU - Soriano Marcolino, Leandro

AU - Angelov, Plamen Parvanov

PY - 2018/9/13

Y1 - 2018/9/13

N2 - In complex environments, agents must be able to cooperate with previously unknown team-mates, and hence dynamically learn about other agents in the environment while searching for optimal actions. Previous works employ Monte Carlo Tree Search approaches. However, the search tree increases exponentially with the number of agents, and only scenarios with very small team sizes have been explored. Hence, in this paper we propose a history-based version of UCT Monte Carlo Tree Search, using a more compact representation than the original algorithm. We perform several experiments with a varying number of agents in the level-based foraging domain, an important testbed for ad-hoc teamwork. We achieve better overall performance than the state-of-the-art and better scalability with team size. Additionally, we contribute an open-source version of our system, making it easier for the research community to use the level-based foraging domain as a benchmark problern for ad-hoc teamwork.

AB - In complex environments, agents must be able to cooperate with previously unknown team-mates, and hence dynamically learn about other agents in the environment while searching for optimal actions. Previous works employ Monte Carlo Tree Search approaches. However, the search tree increases exponentially with the number of agents, and only scenarios with very small team sizes have been explored. Hence, in this paper we propose a history-based version of UCT Monte Carlo Tree Search, using a more compact representation than the original algorithm. We perform several experiments with a varying number of agents in the level-based foraging domain, an important testbed for ad-hoc teamwork. We achieve better overall performance than the state-of-the-art and better scalability with team size. Additionally, we contribute an open-source version of our system, making it easier for the research community to use the level-based foraging domain as a benchmark problern for ad-hoc teamwork.

U2 - 10.1109/AGENTS.2018.8460136

DO - 10.1109/AGENTS.2018.8460136

M3 - Conference contribution/Paper

BT - 2018 IEEE International Conference on Agents (ICA)

PB - IEEE

ER -

Research

Associated organisational units

Electronic data

Links

Text available via DOI: