Large-Scale Stochastic Sampling from the Probability Simplex

Associated organisational units

Electronic data

1806.07137v1
Accepted author manuscript, 450 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Keywords

stat.CO, cs.LG, stat.ML

View graph of relations

Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review

Published

More...

Publication date	3/12/2018
Number of pages	11
Pages	6722-6732
<mark>Original language</mark>	English
Event	32nd Neural Information Processing Systems Conference (NIPS 2018) - Palais des Congrès de Montréal, Montreal, Canada Duration: 3/12/2018 → 8/12/2018 https://nips.cc/

Conference

Conference	32nd Neural Information Processing Systems Conference (NIPS 2018)
Country/Territory	Canada
City	Montreal
Period	3/12/18 → 8/12/18
Internet address	https://nips.cc/

Abstract

Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space, such as the simplex, the time-discretisation error can dominate when we are near the boundary of the space. We demonstrate that while current SGMCMC methods for the simplex perform well in certain cases, they struggle with sparse simplex spaces; when many of the components are close to zero. However, most popular large-scale applications of Bayesian inference on simplex spaces, such as network or topic models, are sparse. We argue that this poor performance is due to the biases of SGMCMC caused by the discretization error. To get around this, we propose the stochastic CIR process, which removes all discretization error and we prove that samples from the stochastic CIR process are asymptotically unbiased. Use of the stochastic CIR process within a SGMCMC algorithm is shown to give substantially better performance for a topic model and a Dirichlet process mixture model than existing SGMCMC approaches.

Research

Associated organisational units

Electronic data

Links

Keywords