Internet publication icon

Internet publication

Conditioning diffusions using Malliavin calculus

Abstract:
In stochastic optimal control and conditional generative modelling, a central computational task is to modify a reference diffusion process to maximise a given terminal-time reward. Most existing methods require this reward to be differentiable, using gradients to steer the diffusion towards favourable outcomes. However, in many practical settings, like diffusion bridges, the reward is singular, taking an infinite value if the target is hit and zero otherwise. We introduce a novel framework, based on Malliavin calculus and path-space integration by parts, that enables the development of methods robust to such singular rewards. This allows our approach to handle a broad range of applications, including classification, diffusion bridges, and conditioning without the need for artificial observational noise. We demonstrate that our approach offers stable and reliable training, outperforming existing techniques.
Publication status:
Published
Peer review status:
Not peer reviewed

Actions


Access Document


Publisher copy:
10.48550/arXiv.2504.03461

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-0821-4607


Host title:
arXiv
Publication date:
2025-04-04
DOI:


Language:
English
Pubs id:
2117808
Local pid:
pubs:2117808
Deposit date:
2025-05-12

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP