Adaptive forgetting factor fictitious play

Mathematics and Statistics

Associated organisational units

Electronic data

pdf
504 KB, PDF document

Keywords

stat.ML, cs.LG, cs.MA

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article

Unpublished

Standard

Adaptive forgetting factor fictitious play. / Smyrnakis, Michalis; S. Leslie, David.
In: arxiv.org, 11.12.2011.

Research output: Contribution to Journal/Magazine › Journal article

Bibtex

@article{b8179eb135364427b909289243a58d17,

title = "Adaptive forgetting factor fictitious play",

abstract = "It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.",

keywords = "stat.ML, cs.LG, cs.MA",

author = "Michalis Smyrnakis and {S. Leslie}, David",

year = "2011",

month = dec,

day = "11",

language = "English",

journal = "arxiv.org",

}

RIS

TY - JOUR

T1 - Adaptive forgetting factor fictitious play

AU - Smyrnakis, Michalis

AU - S. Leslie, David

PY - 2011/12/11

Y1 - 2011/12/11

N2 - It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.

AB - It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents' strategies are stationary. We present a novel variation of fictitious play that allows the use of a more realistic model of opponent strategy. It uses a heuristic approach, from the online streaming data literature, to adaptively update the weights assigned to recently observed actions. We compare the results of the proposed algorithm with those of stochastic and geometric fictitious play in a simple strategic form game, a vehicle target assignment game and a disaster management problem. In all the tests the rate of convergence of the proposed algorithm was similar or better than the variations of fictitious play we compared it with. The new algorithm therefore improves the performance of game-theoretical learning in decentralised optimisation.

KW - stat.ML

KW - cs.LG

KW - cs.MA

M3 - Journal article

JO - arxiv.org

JF - arxiv.org

ER -

Research

Associated organisational units

Electronic data

Keywords