Augmented world models facilitate zero-shot dynamics generalization from a single offline environment

Ball, PJ; Lu, C; Parker-Holder, J; Roberts, S

AI Collection

Conference item

Augmented world models facilitate zero-shot dynamics generalization from a single offline environment

Abstract:: Reinforcement learning from large-scale offline datasets provides us with the ability to learn policies without potentially unsafe or impractical exploration. Significant progress has been made in the past few years in dealing with the challenge of correcting for differing behavior between the data collection and learned policies. However, little attention has been paid to potentially changing dynamics when transferring a policy to the online setting, where performance can be up to 90% reduced for existing methods. In this paper we address this problem with Augmented World Models (AugWM). We augment a learned dynamics model with simple transformations that seek to capture potential changes in physical properties of the robot, leading to more robust policies. We not only train our policy in this new setting, but also provide it with the sampled augmentation as a context, allowing it to adapt to changes in the environment. At test time we learn the context in a self-supervised fashion by approximating the augmentation which corresponds to the new environment. We rigorously evaluate our approach on over 100 different changed dynamics settings, and show that this simple approach can significantly improve the zero-shot generalization of a recent state-of-the-art baseline, often achieving successful policies where the baseline fails.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Ball, P. J., Lu, C., Parker-Holder, J., & Roberts, S. (2021). Augmented world models facilitate zero-shot dynamics generalization from a single offline environment. 38th International Conference on Machine Learning (ICML 2021), 619–629.

MLA Style

Ball, PJ, et al. “Augmented World Models Facilitate Zero-Shot Dynamics Generalization from a Single Offline Environment.” 38th International Conference on Machine Learning (ICML 2021), Proceedings of Machine Learning Research, 2021, pp. 619–29.

Chicago Style

Ball, PJ, C Lu, J Parker-Holder, and S Roberts. 2021. “Augmented World Models Facilitate Zero-Shot Dynamics Generalization from a Single Offline Environment.” In 38th International Conference on Machine Learning (ICML 2021), 619–29. Proceedings of Machine Learning Research. PMLR.
Print

Access Document

Files:: Ball_et_al_2021_Augmented_world_models.pdf

(Preview, Version of record, pdf, 1.6MB, Terms of use)

Publication website:: https://proceedings.mlr.press/v139/ball21a.html

Authors

+ Ball, PJ More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Lu, C More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Parker-Holder, J More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Roberts, S More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author
ORCID:: 0000-0002-9305-9268

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842
Funding agency for:: Lu, C

+ Willowgrove Studentship More from this funder

Funding agency for:: Ball, PJ

Publisher:: PMLR
Host title:: Proceedings of the 38th International Conference on Machine Learning
Pages:: 619-629
Series:: Proceedings of Machine Learning Research
Series number:: 139
Publication date:: 2021-07-07
Acceptance date:: 2021-05-08
Event title:: 38th International Conference on Machine Learning (ICML 2021)
Event location:: Virtual event
Event website:: https://icml.cc/Conferences/2021
Event start date:: 2021-07-18
Event end date:: 2021-07-24
EISSN:: 2640-3498
ISSN:: 2640-3498

Language:: English
Pubs id:: 1197231
Local pid:: pubs:1197231
Deposit date:: 2025-02-18
ARK identifier:: ark:/29072/ora_0503b36c4d074a77816e2f7ae2b6f17b

Terms of use

Copyright holder:: Ball et al.
Rights statement:: Copyright 2021 by the author(s). This is an open access article under the CC-BY license.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

Augmented world models facilitate zero-shot dynamics generalization from a single offline environment

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

Augmented world models facilitate zero-shot dynamics generalization from a single offline environment

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions