Home > Research > Publications & Outputs > SwISS

Electronic data

  • SwISS

    Accepted author manuscript, 892 KB, PDF document

    Available under license: CC BY: Creative Commons Attribution 4.0 International License

Links

Text available via DOI:

View graph of relations

SwISS: A Scalable Markov chain Monte Carlo Divide-and-Conquer Strategy

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

SwISS: A Scalable Markov chain Monte Carlo Divide-and-Conquer Strategy. / Vyner, Callum; Nemeth, Christopher; Sherlock, Chris.
In: Stat, Vol. 12, No. 1, e523, 31.12.2023.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Vyner C, Nemeth C, Sherlock C. SwISS: A Scalable Markov chain Monte Carlo Divide-and-Conquer Strategy. Stat. 2023 Dec 31;12(1):e523. Epub 2023 Jan 2. doi: 10.1002/sta4.523

Author

Bibtex

@article{9bf80df39a0948738740f818c17c38e1,
title = "SwISS: A Scalable Markov chain Monte Carlo Divide-and-Conquer Strategy",
abstract = "Divide-and-conquer strategies for Monte Carlo algorithms are an increasingly popular approach to making Bayesian inference scalable to large data sets. In its simplest form, the data are partitioned across multiple computing cores and a separate Markov chain Monte Carlo algorithm on each core targets the associated partial posterior distribution, which we refer to as a sub-posterior, that is the posterior given only the data from the segment of the partition associated with that core. Divide-and-conquer techniques reduce computational, memory and disk bottle necks, but make it difficult to recombine the sub-posterior samples. We propose SwISS: Sub-posteriors with Inflation, Scaling and Shifting; a new approach for recombining the sub-posterior samples which is simple to apply, scales to high-dimensional parameter spaces and accurately approximates the original posterior distribution through affine transformations of the sub-posterior samples. We prove that our transformation is asymptotically optimal across a natural set of affine transformations and illustrate the efficacy of SwISS against competing algorithms on synthetic and real-world data sets.",
keywords = "Markov chain Monte Carlo, divide-and-conquer, parallel MCMC, big data",
author = "Callum Vyner and Christopher Nemeth and Chris Sherlock",
year = "2023",
month = dec,
day = "31",
doi = "10.1002/sta4.523",
language = "English",
volume = "12",
journal = "Stat",
issn = "2049-1573",
publisher = "Wiley-Blackwell Publishing Ltd",
number = "1",

}

RIS

TY - JOUR

T1 - SwISS

T2 - A Scalable Markov chain Monte Carlo Divide-and-Conquer Strategy

AU - Vyner, Callum

AU - Nemeth, Christopher

AU - Sherlock, Chris

PY - 2023/12/31

Y1 - 2023/12/31

N2 - Divide-and-conquer strategies for Monte Carlo algorithms are an increasingly popular approach to making Bayesian inference scalable to large data sets. In its simplest form, the data are partitioned across multiple computing cores and a separate Markov chain Monte Carlo algorithm on each core targets the associated partial posterior distribution, which we refer to as a sub-posterior, that is the posterior given only the data from the segment of the partition associated with that core. Divide-and-conquer techniques reduce computational, memory and disk bottle necks, but make it difficult to recombine the sub-posterior samples. We propose SwISS: Sub-posteriors with Inflation, Scaling and Shifting; a new approach for recombining the sub-posterior samples which is simple to apply, scales to high-dimensional parameter spaces and accurately approximates the original posterior distribution through affine transformations of the sub-posterior samples. We prove that our transformation is asymptotically optimal across a natural set of affine transformations and illustrate the efficacy of SwISS against competing algorithms on synthetic and real-world data sets.

AB - Divide-and-conquer strategies for Monte Carlo algorithms are an increasingly popular approach to making Bayesian inference scalable to large data sets. In its simplest form, the data are partitioned across multiple computing cores and a separate Markov chain Monte Carlo algorithm on each core targets the associated partial posterior distribution, which we refer to as a sub-posterior, that is the posterior given only the data from the segment of the partition associated with that core. Divide-and-conquer techniques reduce computational, memory and disk bottle necks, but make it difficult to recombine the sub-posterior samples. We propose SwISS: Sub-posteriors with Inflation, Scaling and Shifting; a new approach for recombining the sub-posterior samples which is simple to apply, scales to high-dimensional parameter spaces and accurately approximates the original posterior distribution through affine transformations of the sub-posterior samples. We prove that our transformation is asymptotically optimal across a natural set of affine transformations and illustrate the efficacy of SwISS against competing algorithms on synthetic and real-world data sets.

KW - Markov chain Monte Carlo

KW - divide-and-conquer

KW - parallel MCMC

KW - big data

U2 - 10.1002/sta4.523

DO - 10.1002/sta4.523

M3 - Journal article

VL - 12

JO - Stat

JF - Stat

SN - 2049-1573

IS - 1

M1 - e523

ER -