Home > Research > Publications & Outputs > Asynchronous stochastic approximation with diff...

Electronic data

  • 409-446

    Submitted manuscript, 308 KB, PDF document

    Available under license: CC BY

Links

Text available via DOI:

View graph of relations

Asynchronous stochastic approximation with differential inclusions

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Asynchronous stochastic approximation with differential inclusions. / Perkins, Steven ; Leslie, David S.
In: Stochastic Systems, Vol. 2, 2012, p. 409-446.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Perkins S, Leslie DS. Asynchronous stochastic approximation with differential inclusions. Stochastic Systems. 2012;2:409-446. doi: 10.1214/11-SSY056

Author

Perkins, Steven ; Leslie, David S. / Asynchronous stochastic approximation with differential inclusions. In: Stochastic Systems. 2012 ; Vol. 2. pp. 409-446.

Bibtex

@article{137918a32d55447e8477f0d8cc87ed95,
title = "Asynchronous stochastic approximation with differential inclusions",
abstract = "The asymptotic pseudo-trajectory approach to stochastic approximation of Bena{\"i}m, Hofbauer and Sorin is extended for asynchronous stochastic approximations with a set-valued mean field. The asynchronicity of the process is incorporated into the mean field to produce convergence results which remain similar to those of an equivalent synchronous process. In addition, this allows many of the restrictive assumptions previously associated with asynchronous stochastic approximation to be removed. The framework is extended for a coupled asynchronous stochastic approximation process with set-valued mean fields. Two-timescales arguments are used here in a similar manner to the original work in this area by Borkar. The applicability of this approach is demonstrated through learning in a Markov decision process. ",
author = "Steven Perkins and Leslie, {David S.}",
year = "2012",
doi = "10.1214/11-SSY056",
language = "English",
volume = "2",
pages = "409--446",
journal = "Stochastic Systems",
issn = "1946-5238",
publisher = "INFORMS Institute for Operations Research and the Management Sciences",

}

RIS

TY - JOUR

T1 - Asynchronous stochastic approximation with differential inclusions

AU - Perkins, Steven

AU - Leslie, David S.

PY - 2012

Y1 - 2012

N2 - The asymptotic pseudo-trajectory approach to stochastic approximation of Benaïm, Hofbauer and Sorin is extended for asynchronous stochastic approximations with a set-valued mean field. The asynchronicity of the process is incorporated into the mean field to produce convergence results which remain similar to those of an equivalent synchronous process. In addition, this allows many of the restrictive assumptions previously associated with asynchronous stochastic approximation to be removed. The framework is extended for a coupled asynchronous stochastic approximation process with set-valued mean fields. Two-timescales arguments are used here in a similar manner to the original work in this area by Borkar. The applicability of this approach is demonstrated through learning in a Markov decision process.

AB - The asymptotic pseudo-trajectory approach to stochastic approximation of Benaïm, Hofbauer and Sorin is extended for asynchronous stochastic approximations with a set-valued mean field. The asynchronicity of the process is incorporated into the mean field to produce convergence results which remain similar to those of an equivalent synchronous process. In addition, this allows many of the restrictive assumptions previously associated with asynchronous stochastic approximation to be removed. The framework is extended for a coupled asynchronous stochastic approximation process with set-valued mean fields. Two-timescales arguments are used here in a similar manner to the original work in this area by Borkar. The applicability of this approach is demonstrated through learning in a Markov decision process.

U2 - 10.1214/11-SSY056

DO - 10.1214/11-SSY056

M3 - Journal article

VL - 2

SP - 409

EP - 446

JO - Stochastic Systems

JF - Stochastic Systems

SN - 1946-5238

ER -