Reinforcement Learning for Adaptive MCMC

Wang, Congye; Chen, Wilson; Kanagawa, Heishiro; Oates, Chris. J.

Statistics > Computation

arXiv:2405.13574 (stat)

[Submitted on 22 May 2024]

Title:Reinforcement Learning for Adaptive MCMC

Authors:Congye Wang, Wilson Chen, Heishiro Kanagawa, Chris. J. Oates

View PDF HTML (experimental)

Abstract:An informal observation, made by several authors, is that the adaptive design of a Markov transition kernel has the flavour of a reinforcement learning task. Yet, to-date it has remained unclear how to actually exploit modern reinforcement learning technologies for adaptive MCMC. The aim of this paper is to set out a general framework, called Reinforcement Learning Metropolis--Hastings, that is theoretically supported and empirically validated. Our principal focus is on learning fast-mixing Metropolis--Hastings transition kernels, which we cast as deterministic policies and optimise via a policy gradient. Control of the learning rate provably ensures conditions for ergodicity are satisfied. The methodology is used to construct a gradient-free sampler that out-performs a popular gradient-free adaptive Metropolis--Hastings algorithm on $\approx 90 \%$ of tasks in the PosteriorDB benchmark.

Subjects:	Computation (stat.CO); Machine Learning (cs.LG)
Cite as:	arXiv:2405.13574 [stat.CO]
	(or arXiv:2405.13574v1 [stat.CO] for this version)
	https://doi.org/10.48550/arXiv.2405.13574

Submission history

From: Chris Oates [view email]
[v1] Wed, 22 May 2024 12:11:12 UTC (24,202 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.CO

< prev | next >

new | recent | 2024-05

Change to browse by:

cs
cs.LG
stat

References & Citations

export BibTeX citation

Statistics > Computation

Title:Reinforcement Learning for Adaptive MCMC

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Computation

Title:Reinforcement Learning for Adaptive MCMC

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators