TACO: Learning task decomposition via temporal alignment for control

Shiarlis, K; Wulfmeier, M; Salter, S; Whiteson, S; Posner, H

AI Collection

Conference item

TACO: Learning task decomposition via temporal alignment for control

Abstract:: Many advanced Learning from Demonstration (LfD) methods consider the decomposition of complex, real-world tasks into simpler sub-tasks. By reusing the corresponding sub-policies within and between tasks, they provide training data for each policy from different high-level tasks and compose them to perform novel ones. However, most existing approaches to modular LfD focus either on learning a single high-level task or depend on domain knowledge and temporal segmentation. By contrast, we propose a weakly supervised, domain-agnostic approach based on task sketches, which include only the sequence of sub-tasks performed in each demonstration. Our approach simultaneously aligns the sketches with the observed demonstrations and learns the required sub-policies, which improves generalisation in comparison to separate optimisation procedures. We evaluate the approach on multiple domains, including a simulated 3D robot arm control task using purely image-based observations. The approach performs commensurately with fully supervised approaches, while requiring significantly less annotation effort, and significantly outperforms methods which separate segmentation and imitation.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Shiarlis, K., Wulfmeier, M., Salter, S., Whiteson, S., & Posner, H. (2018). TACO: Learning task decomposition via temporal alignment for control.

MLA Style

Shiarlis, K, et al. “TACO: Learning Task Decomposition via Temporal Alignment for Control.” 2018.

Chicago Style

Shiarlis, K, M Wulfmeier, S Salter, S Whiteson, and H Posner. 2018. “TACO: Learning Task Decomposition via Temporal Alignment for Control.”
Print

Access Document

Files:: Whiteson et al, TACO - Learning task decomposition via tempora...

(Preview, Accepted manuscript, pdf, 3.1MB, Terms of use)

Authors

+ Shiarlis, K More by this author

Role:: Author

+ Wulfmeier, M More by this author

Institution:: University of Oxford
Division:: MPLS Division
Department:: Engineering Science
Role:: Author

+ Salter, S More by this author

Institution:: University of Oxford
Division:: MPLS Division
Department:: Engineering Science
Role:: Author

+ Whiteson, S More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Oxford college:: St Catherine's College
Role:: Author

+ Posner, H More by this author

Institution:: University of Oxford
Division:: MPLS Division
Department:: Engineering Science
Role:: Author

+ Seventh Framework Programme More from this funder

Grant:: 611153

+ Engineering and Physical Sciences Research Council More from this funder

Grant:: Studentship

Publisher:: Journal of Machine Learning Research
Host title:: International Conference on Machine Learning
Journal:: Thirty-fifth International Conference on Machine Learning (ICML 2018) More from this journal
Publication date:: 2018-07-03
Acceptance date:: 2018-06-12

Pubs id:: pubs:857022
UUID:: uuid:db521575-3720-4091-94e7-e6a5da1fedb5
Local pid:: pubs:857022
Source identifiers:: 857022
Deposit date:: 2018-06-12
ARK identifier:: ark:/29072/ora_db5215753720409194e7e6a5da1fedb5

Terms of use

Copyright holder:: Whiteson et al
Notes:: Copyright 2018 by the author(s). This is the accepted manuscript version of the article. The final version is available online from Journal of Machine Learning Research at: http://proceedings.mlr.press/v80/shiarlis18a.html

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

TACO: Learning task decomposition via temporal alignment for control

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

TACO: Learning task decomposition via temporal alignment for control

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions