AI Collection

Conference item

Probabilistic performance guarantees for multi-task reinforcement learning

Abstract:: Multi-task reinforcement learning trains generalist policies that can execute multiple tasks. While recent years have seen significant progress, existing approaches rarely provide formal performance guarantees, which are indispensable when deploying policies in safety-critical settings. We present an approach for computing high-confidence guarantees on the performance of a multi-task policy on tasks not seen during training. Concretely, we introduce a new generalisation bound that composes (i) per-task lower confidence bounds from finitely many rollouts with (ii) task-level generalisation from finitely many sampled tasks, yielding a high-confidence guarantee for new tasks drawn from the same arbitrary and unknown distribution. Across state-of-the-art multi-task RL methods, we show that the guarantees are theoretically sound and informative at realistic sample sizes.

Publication status:: Accepted

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Schnitzer, Y., Jackermeier, M., Abate, A., & Parker, D. A. (2026). Probabilistic performance guarantees for multi-task reinforcement learning. 43rd International Conference on Machine Learning (ICML'26).

MLA Style

Schnitzer, Y, et al. “Probabilistic Performance Guarantees for Multi-Task Reinforcement Learning.” 43rd International Conference on Machine Learning (ICML'26), 2026.

Chicago Style

Schnitzer, Y, M Jackermeier, A Abate, and DA Parker. 2026. “Probabilistic Performance Guarantees for Multi-Task Reinforcement Learning.” In 43rd International Conference on Machine Learning (ICML'26).
Print

Authors

+ Schnitzer, Y More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Role:: Author
ORCID:: 0000-0001-7406-3440

+ Jackermeier, M More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Role:: Author

+ Abate, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Role:: Author

+ Parker, DA More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Role:: Author

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842
Grant:: EP/Y028872/1

Host title:: Proceedings of the 43rd International Conference on Machine Learning (PMLR 306)
Acceptance date:: 2026-04-30
Event title:: 43rd International Conference on Machine Learning (ICML'26)
Event location:: Seoul, South Korea
Event website:: https://icml.cc/
Event start date:: 2026-07-06
Event end date:: 2026-07-11

Language:: English
Pubs id:: 2431515
Local pid:: pubs:2431515
Deposit date:: 2026-06-09
ARK identifier:: ark:/29072/ora_75a4ea54c27d443b95d95f00552de54d

Terms of use

Copyright date:: 2026
Rights statement:: Copyright 2026 by the author(s).

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP