Conference item
Rao-Blackwellised reparameterisation gradients
- Abstract:
- Latent Gaussian variables have been popularised in probabilistic machine learning. In turn, gradient estimators are the machinery that facilitates gradient-based optimisation for models with latent Gaussian variables. The reparameterisation trick is often used as the default estimator as it is simple to implement and yields low-variance gradients for variational inference. In this work, we propose the R2-G2 estimator as the Rao-Blackwellisation of the reparameterisation gradient estimator. Interestingly, we show that the local reparameterisation gradient estimator for Bayesian MLPs is an instance of the R2-G2 estimator and Rao-Blackwellisation. This lets us extend benefits of Rao-Blackwellised gradients to a suite of probabilistic models. We show that initial training with R2-G2 consistently yields better performance in models with multiple applications of the reparameterisation trick.
- Publication status:
- Accepted
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 561.8KB, Terms of use)
-
- Publication website:
- https://neurips.cc/virtual/2025/loc/san-diego/poster/118582
Authors
- Publisher:
- Neural Information Processing Systems Foundation
- Publication date:
- 2025-06-09
- Acceptance date:
- 2025-09-18
- Event title:
- Thirty-Ninth Annual Conference on Neural Information Processing Systems
- Event location:
- San Diego, California, USA and Mexico City, Mexico
- Event website:
- https://neurips.cc/
- Event start date:
- 2025-11-30
- Event end date:
- 2025-12-07
- Language:
-
English
- Pubs id:
-
2129958
- Local pid:
-
pubs:2129958
- Deposit date:
-
2025-09-19
- ARK identifier:
Terms of use
- Copyright holder:
- Lam et al
- Copyright date:
- 2025
- Rights statement:
- ©2025 The Authors
- Notes:
-
This paper was presented at The Thirty-Ninth Annual Conference on Neural Information Processing Systems, 30/11-7/12/2025, San Diego, California, USA and Mexico City, Mexico
The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record