Home > Research > Publications & Outputs > Adaptive forgetting factor fictitious play

Electronic data

  • pdf

    504 KB, PDF document

Keywords

View graph of relations

Adaptive forgetting factor fictitious play

Research output: Contribution to Journal/MagazineJournal article

Unpublished

Standard

Adaptive forgetting factor fictitious play. / Smyrnakis, Michalis; S. Leslie, David.
In: arxiv.org, 11.12.2011.

Research output: Contribution to Journal/MagazineJournal article

Harvard

APA

Vancouver

Smyrnakis M, S. Leslie D. Adaptive forgetting factor fictitious play. arxiv.org. 2011 Dec 11.

Author

Smyrnakis, Michalis ; S. Leslie, David. / Adaptive forgetting factor fictitious play. In: arxiv.org. 2011.

Bibtex

@article{b8179eb135364427b909289243a58d17,
title = "Adaptive forgetting factor fictitious play",
abstract = "It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.",
keywords = "stat.ML, cs.LG, cs.MA",
author = "Michalis Smyrnakis and {S. Leslie}, David",
year = "2011",
month = dec,
day = "11",
language = "English",
journal = "arxiv.org",

}

RIS

TY - JOUR

T1 - Adaptive forgetting factor fictitious play

AU - Smyrnakis, Michalis

AU - S. Leslie, David

PY - 2011/12/11

Y1 - 2011/12/11

N2 - It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.

AB - It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.

KW - stat.ML

KW - cs.LG

KW - cs.MA

M3 - Journal article

JO - arxiv.org

JF - arxiv.org

ER -