Efficient and Generalizable Tuning Strategies for Stochastic Gradient MCMC

School Of Mathematical Sciences

Associated organisational unit

Statistical Artificial Intelligence

Electronic data

2105.13059v3
Submitted manuscript, 911 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

stat.CO, stat.ME, stat.ML

View graph of relations

Research output: Working paper › Preprint

Published

More...

Publication date	27/05/2021
Publisher	Arxiv
<mark>Original language</mark>	English

Abstract

Stochastic gradient Markov chain Monte Carlo (SGMCMC) is a popular class of algorithms for scalable Bayesian inference. However, these algorithms include hyperparameters such as step size or batch size that influence the accuracy of estimators based on the obtained posterior samples. As a result, these hyperparameters must be tuned by the practitioner and currently no principled and automated way to tune them exists. Standard MCMC tuning methods based on acceptance rates cannot be used for SGMCMC, thus requiring alternative tools and diagnostics. We propose a novel bandit-based algorithm that tunes the SGMCMC hyperparameters by minimizing the Stein discrepancy between the true posterior and its Monte Carlo approximation. We provide theoretical results supporting this approach and assess various Stein-based discrepancies. We support our results with experiments on both simulated and real datasets, and find that this method is practical for a wide range of applications.

Research

Associated organisational unit

Electronic data

Links

Keywords

Efficient and Generalizable Tuning Strategies for Stochastic Gradient MCMC

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us