Internet publication
Conditioning diffusions using Malliavin calculus
- Abstract:
- In stochastic optimal control and conditional generative modelling, a central computational task is to modify a reference diffusion process to maximise a given terminal-time reward. Most existing methods require this reward to be differentiable, using gradients to steer the diffusion towards favourable outcomes. However, in many practical settings, like diffusion bridges, the reward is singular, taking an infinite value if the target is hit and zero otherwise. We introduce a novel framework, based on Malliavin calculus and path-space integration by parts, that enables the development of methods robust to such singular rewards. This allows our approach to handle a broad range of applications, including classification, diffusion bridges, and conditioning without the need for artificial observational noise. We demonstrate that our approach offers stable and reliable training, outperforming existing techniques.
- Publication status:
- Published
- Peer review status:
- Not peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Pre-print, pdf, 6.8MB, Terms of use)
-
- Publisher copy:
- 10.48550/arXiv.2504.03461
Authors
- Host title:
- arXiv
- Publication date:
- 2025-04-04
- DOI:
- Language:
-
English
- Pubs id:
-
2117808
- Local pid:
-
pubs:2117808
- Deposit date:
-
2025-05-12
Terms of use
- Copyright holder:
- Pidstrigach et al.
- Copyright date:
- 2025
- Rights statement:
- Copyright © 2025 The Author(s).
If you are the owner of this record, you can report an update to it here: Report update to this record