Probabilistic machine learning: methods and applications to continuous control

Hasenclever, L

Thesis

Probabilistic machine learning: methods and applications to continuous control

Abstract:: Probabilistic inference is at the core of many recent advances in machine learning. Unfortunately, exact inference is intractable in all but the simplest models. Thus, approximate inference methods are required use probabilistic methods for large and complex models. Broadly speaking, there are two different paradigms for approximate inference, sampling methods and variational methods. Sampling methods attempt to construct approximate samples from the target distribution, which can then be used to approximate expectations with respect to the posterior. Variational methods instead rephrase inference as an optimisation problem and form parametric approximations to the target distribution.

In this thesis, we present contributions to sampling methods and variational methods with a focus on scalability. Firstly, we introduce a novel sampling technique based on Hamiltonian Monte Carlo that uses a relativistic kinetic energy to improve robustness to hyperparameters. We then describe a novel algorithm for distributed Bayesian learning based on expectation propagation techniques. In addition, we present a novel normalising flow that can be used to form more flexible variational approximations within variational inference.

We then describe two applications of probabilistic thinking and variational techniques to the field a continuous control. Firstly, we describe how reinforcement learning can be viewed as probabilistic inference and introduce a novel algorithm for learning priors in reinforcement learning leading to substantial improvement in learning speed and final performance in certain settings. Lastly, we describe a probabilistic model that can be used to compress thousands of expert policies trained to reproduce motion capture data into one model that is capable of one-shot imitation. We further demonstrate that it is possible to reuse our model, resulting in naturalistic movements on challenging control tasks.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Hasenclever, L. (2018). Probabilistic machine learning: methods and applications to continuous control [PhD thesis]. University of Oxford.

MLA Style

Hasenclever, L. Probabilistic Machine Learning: Methods and Applications to Continuous Control. 2018. University of Oxford, PhD thesis.

Chicago Style

Hasenclever, L. 2018. “Probabilistic Machine Learning: Methods and Applications to Continuous Control.” PhD thesis, University of Oxford.
Print

Access Document

Files:: thesis_final.pdf

(Preview, pdf, 13.3MB, Terms of use)

Authors

+ Hasenclever, L More by this author

Division:: MPLS
Department:: Statistics
Role:: Author

Contributors

+ Teh, Y

Role:: Supervisor

+ Engineering & Physical Sciences Research Council More from this funder

Grant:: EP/L016710/1

DOI:: 10.5287/ora-wxo6xnexp
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Keywords:: Machine Learning

Continuous Control
UUID:: uuid:d5fdd706-38d0-4c3c-bab3-1696b012b781
Deposit date:: 2020-01-19
ARK identifier:: ark:/29072/ora_d5fdd70638d04c3cbab31696b012b781

Terms of use

Copyright holder:: Hasenclever, L

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Thesis

Probabilistic machine learning: methods and applications to continuous control

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Thesis

Probabilistic machine learning: methods and applications to continuous control

Actions

Access Document

Authors

Contributors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions