Home > Research > Publications & Outputs > Constructing summary statistics for approximate...
View graph of relations

Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion)

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion). / Fearnhead, Paul; Prangle, Dennis.
In: Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 74, No. 3, 06.2012, p. 419-474.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Fearnhead, P & Prangle, D 2012, 'Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion)', Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 74, no. 3, pp. 419-474. https://doi.org/10.1111/j.1467-9868.2011.01010.x

APA

Vancouver

Fearnhead P, Prangle D. Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion). Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2012 Jun;74(3):419-474. Epub 2012 May 15. doi: 10.1111/j.1467-9868.2011.01010.x

Author

Fearnhead, Paul ; Prangle, Dennis. / Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion). In: Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2012 ; Vol. 74, No. 3. pp. 419-474.

Bibtex

@article{b5730ea3c8d44f44a9997250ac08fa38,
title = "Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion)",
abstract = "Many modern statistical applications involve inference for complex stochastic models, where it is easy to simulate from the models, but impossible to calculate likelihoods. Approximate Bayesian computation (ABC) is a method of inference for such models. It replaces calculation of the likelihood by a step which involves simulating artificial data for different parameter values, and comparing summary statistics of the simulated data with summary statistics of the observed data. Here we show how to construct appropriate summary statistics for ABC in a semi-automatic manner. We aim for summary statistics which will enable inference about certain parameters of interest to be as accurate as possible. Theoretical results show that optimal summary statistics are the posterior means of the parameters. Although these cannot be calculated analytically, we use an extra stage of simulation to estimate how the posterior means vary as a function of the data; and we then use these estimates of our summary statistics within ABC. Empirical results show that our approach is a robust method for choosing summary statistics that can result in substantially more accurate ABC analyses than the ad hoc choices of summary statistics that have been proposed in the literature. We also demonstrate advantages over two alternative methods of simulation-based inference.",
keywords = "Indirect inference , Likelihood-free inference , Markov chain Monte Carlo methods , Simulation , Stochastic kinetic networks",
author = "Paul Fearnhead and Dennis Prangle",
year = "2012",
month = jun,
doi = "10.1111/j.1467-9868.2011.01010.x",
language = "English",
volume = "74",
pages = "419--474",
journal = "Journal of the Royal Statistical Society: Series B (Statistical Methodology)",
issn = "1369-7412",
publisher = "Wiley-Blackwell",
number = "3",

}

RIS

TY - JOUR

T1 - Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation (with Discussion)

AU - Fearnhead, Paul

AU - Prangle, Dennis

PY - 2012/6

Y1 - 2012/6

N2 - Many modern statistical applications involve inference for complex stochastic models, where it is easy to simulate from the models, but impossible to calculate likelihoods. Approximate Bayesian computation (ABC) is a method of inference for such models. It replaces calculation of the likelihood by a step which involves simulating artificial data for different parameter values, and comparing summary statistics of the simulated data with summary statistics of the observed data. Here we show how to construct appropriate summary statistics for ABC in a semi-automatic manner. We aim for summary statistics which will enable inference about certain parameters of interest to be as accurate as possible. Theoretical results show that optimal summary statistics are the posterior means of the parameters. Although these cannot be calculated analytically, we use an extra stage of simulation to estimate how the posterior means vary as a function of the data; and we then use these estimates of our summary statistics within ABC. Empirical results show that our approach is a robust method for choosing summary statistics that can result in substantially more accurate ABC analyses than the ad hoc choices of summary statistics that have been proposed in the literature. We also demonstrate advantages over two alternative methods of simulation-based inference.

AB - Many modern statistical applications involve inference for complex stochastic models, where it is easy to simulate from the models, but impossible to calculate likelihoods. Approximate Bayesian computation (ABC) is a method of inference for such models. It replaces calculation of the likelihood by a step which involves simulating artificial data for different parameter values, and comparing summary statistics of the simulated data with summary statistics of the observed data. Here we show how to construct appropriate summary statistics for ABC in a semi-automatic manner. We aim for summary statistics which will enable inference about certain parameters of interest to be as accurate as possible. Theoretical results show that optimal summary statistics are the posterior means of the parameters. Although these cannot be calculated analytically, we use an extra stage of simulation to estimate how the posterior means vary as a function of the data; and we then use these estimates of our summary statistics within ABC. Empirical results show that our approach is a robust method for choosing summary statistics that can result in substantially more accurate ABC analyses than the ad hoc choices of summary statistics that have been proposed in the literature. We also demonstrate advantages over two alternative methods of simulation-based inference.

KW - Indirect inference

KW - Likelihood-free inference

KW - Markov chain Monte Carlo methods

KW - Simulation

KW - Stochastic kinetic networks

U2 - 10.1111/j.1467-9868.2011.01010.x

DO - 10.1111/j.1467-9868.2011.01010.x

M3 - Journal article

VL - 74

SP - 419

EP - 474

JO - Journal of the Royal Statistical Society: Series B (Statistical Methodology)

JF - Journal of the Royal Statistical Society: Series B (Statistical Methodology)

SN - 1369-7412

IS - 3

ER -