Pessimistic Bayesianism for conservative optimization and imitation

Cohen, M

Thesis

Pessimistic Bayesianism for conservative optimization and imitation

Abstract:: Subject to several assumptions, sufficiently advanced reinforcement learners would likely face an incentive and likely have an ability to intervene in the provision of their reward, with catastrophic consequences. In this thesis, I develop a theory of pessimism and show how it can produce safe advanced artificial agents. Not only do I demonstrate that the assumptions mentioned above can be avoided; I prove theorems which demonstrate safety. First, I develop an idealized pessimistic reinforcement learner. For any given novel event that a mentor would never cause, a sufficiently pessimistic reinforcement learner trained with the help of that mentor would probably avoid causing it. This result is without precedent in the literature. Next, on similar principles, I develop an idealized pessimistic imitation learner. If the probability of an event when the demonstrator acts can be bounded above, then the probability can be bounded above when the imitator acts instead; this kind of result is unprecedented when the imitator learns online and the environment never resets. In an environment that never resets, no one has previously demonstrated, to my knowledge, that an imitation learner even exists. Finally, both of the agents above demand more efficient algorithms for high-quality uncertainty quantification, so I have developed a new kernel for Gaussian process modelling that allows for log-linear time complexity and linear space complexity, instead of a naïve cubic time complexity and quadratic space complexity. This is not the first Gaussian process with this time complexity—inducing points methods have linear complexity—but we do outperform such methods significantly on regression benchmarks, as one might expect given the much higher dimensionality of our kernel. This thesis shows the viability of pessimism with respect to well-quantified epistemic uncertainty as a path to safe artificial agency.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Cohen, M. (2023). Pessimistic Bayesianism for conservative optimization and imitation [PhD thesis]. University of Oxford.

MLA Style

Cohen, M. Pessimistic Bayesianism for Conservative Optimization and Imitation. 2023. University of Oxford, PhD thesis.

Chicago Style

Cohen, M. 2023. “Pessimistic Bayesianism for Conservative Optimization and Imitation.” PhD thesis, University of Oxford.
Print

Access Document

Files:: Cohen_2023_Pessimistic_Bayesianism.pdf

(Preview, Dissemination version, pdf, 4.5MB, Terms of use)

Authors

+ Cohen, M More by this author

Division:: MPLS
Department:: Engineering Science
Role:: Author

Contributors

+ Osborne, M

Role:: Supervisor

DOI:: 10.5287/ora-ag19naqn2
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Language:: English
Keywords:: Gaussian Processes

Bayesian Inference

AI Safety
Subjects:: Artificial intelligence
Deposit date:: 2023-07-21
ARK identifier:: ark:/29072/ora_1abdcbb591764aad986dcab3e9fe7f6d

Terms of use

Copyright holder:: Cohen, M

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Thesis

Pessimistic Bayesianism for conservative optimization and imitation

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Thesis

Pessimistic Bayesianism for conservative optimization and imitation

Actions

Access Document

Authors

Contributors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions