Thesis icon

Thesis

Safe learning in humans and machines

Abstract:
Intelligent agents, biological or artificial, face a fundamental dilemma: how to learn safely from experience when learning inevitably involves making mistakes. This requires the ability to explore environments with caution during learning (i.e., safe exploration) and to infer deviations from homeostatic grace, such as injury in animals or faults in robots, to reorganise behaviour appropriately (i.e., self-preservation).

This work first explores safe exploration through strategies that combine multiple value functions. A critical safety-efficiency trade-off is identified, arising from the conflict between instrumental control, which learns the consequences of actions, and learned defensive reflexes such as Pavlovian biases to withdraw from aversive stimuli. It is hypothesised that this trade-off can be resolved by gating Pavlovian avoidance based on outcome uncertainty, and a basic test is provided in a human approach–withdrawal virtual reality experiment. Noting the suboptimality underlying Pavlovian misbehaviour, the thesis subsequently proposes a mechanism by which the dopaminergic system could optimally compose multiple values to support efficient, safe, and stable learning.

Shifting focus from external threats to bodily integrity, the thesis next addresses the problem of self-preservation, with particular emphasis on the computational representation of injury. Post-injury homeostasis is modelled as a partially observable Markov decision process (POMDP), explaining counterintuitive behaviours such as investigating an injury despite immediate pain. This framework is used to mathematically formalise an information-restriction model of pain chronification, providing a quantitative complement to the Fear-Avoidance model. These concepts are then extended to machines: robots performing stereotypical movements can employ self-supervised learning and local learning rules to build internal models of expected sensorimotor experience, enabling fault detection and adaptive responses to unexpected deviations.

Together, this work advances the understanding of safe learning, a challenge shared by humans and machines, with implications for understanding post-injury transitions to chronic pain and the development of neuro-inspired safe AI.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Clinical Neurosciences
Oxford college:
Linacre College
Role:
Author
ORCID:
https://orcid.org/0009-0001-2507-545

Contributors

Institution:
University of Oxford
Division:
MSD
Department:
Clinical Neurosciences
Oxford college:
Linacre College
Role:
Supervisor
ORCID:
0000-0003-1724-5832
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Reuben College
Role:
Supervisor
ORCID:
0000-0002-4371-4623
Institution:
University of Oxford
Division:
MSD
Department:
Clinical Neurosciences
Oxford college:
Linacre College
Role:
Examiner
Institution:
Aarhus University
Role:
Examiner


More from this funder
Funder identifier:
https://ror.org/029chgv08
Grant:
214251/Z/18/Z
More from this funder
Funder identifier:
https://ror.org/00aps1a34
Grant:
NIHR203316
More from this funder
Funder identifier:
https://ror.org/01g0hqq23
Grant:
MSIT 2019-0-01371
More from this funder
Funder identifier:
https://ror.org/00hhkn466
Grant:
22H04998


DOI:
Type of award:
DPhil
Awarding institution:
University of Oxford



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP