Thesis
Safe learning in humans and machines
- Abstract:
-
Intelligent agents, biological or artificial, face a fundamental dilemma: how to learn safely from experience when learning inevitably involves making mistakes. This requires the ability to explore environments with caution during learning (i.e., safe exploration) and to infer deviations from homeostatic grace, such as injury in animals or faults in robots, to reorganise behaviour appropriately (i.e., self-preservation).
This work first explores safe exploration through strategies that combine multiple value functions. A critical safety-efficiency trade-off is identified, arising from the conflict between instrumental control, which learns the consequences of actions, and learned defensive reflexes such as Pavlovian biases to withdraw from aversive stimuli. It is hypothesised that this trade-off can be resolved by gating Pavlovian avoidance based on outcome uncertainty, and a basic test is provided in a human approach–withdrawal virtual reality experiment. Noting the suboptimality underlying Pavlovian misbehaviour, the thesis subsequently proposes a mechanism by which the dopaminergic system could optimally compose multiple values to support efficient, safe, and stable learning.
Shifting focus from external threats to bodily integrity, the thesis next addresses the problem of self-preservation, with particular emphasis on the computational representation of injury. Post-injury homeostasis is modelled as a partially observable Markov decision process (POMDP), explaining counterintuitive behaviours such as investigating an injury despite immediate pain. This framework is used to mathematically formalise an information-restriction model of pain chronification, providing a quantitative complement to the Fear-Avoidance model. These concepts are then extended to machines: robots performing stereotypical movements can employ self-supervised learning and local learning rules to build internal models of expected sensorimotor experience, enabling fault detection and adaptive responses to unexpected deviations.
Together, this work advances the understanding of safe learning, a challenge shared by humans and machines, with implications for understanding post-injury transitions to chronic pain and the development of neuro-inspired safe AI.
Actions
Access Document
- Files:
-
-
(Preview, Dissemination version, pdf, 45.8MB, Terms of use)
-
Authors
Contributors
+ Seymour, B
- Institution:
- University of Oxford
- Division:
- MSD
- Department:
- Clinical Neurosciences
- Oxford college:
- Linacre College
- Role:
- Supervisor
- ORCID:
- 0000-0003-1724-5832
+ Havoutis, I
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Engineering Science
- Oxford college:
- Reuben College
- Role:
- Supervisor
- ORCID:
- 0000-0002-4371-4623
+ Bogacz, R
- Institution:
- University of Oxford
- Division:
- MSD
- Department:
- Clinical Neurosciences
- Oxford college:
- Linacre College
- Role:
- Examiner
+ Fardo, F
- Institution:
- Aarhus University
- Role:
- Examiner
+ Wellcome Trust
More from this funder
- Funder identifier:
- https://ror.org/029chgv08
- Grant:
- 214251/Z/18/Z
+ NIHR Oxford Musculoskeletal Biomedical Research Centre
More from this funder
- Funder identifier:
- https://ror.org/00aps1a34
- Grant:
- NIHR203316
+ Institute of Information & Communications Technology Planning & Evaluation
More from this funder
- Funder identifier:
- https://ror.org/01g0hqq23
- Grant:
- MSIT 2019-0-01371
+ Japan Society for the Promotion of Science
More from this funder
- Funder identifier:
- https://ror.org/00hhkn466
- Grant:
- 22H04998
- DOI:
- Type of award:
- DPhil
- Awarding institution:
- University of Oxford
- Language:
-
English
- Keywords:
- Subjects:
- Pubs id:
-
2407641
- Local pid:
-
pubs:2407641
- Deposit date:
-
2026-03-10
- ARK identifier:
Terms of use
- Copyright holder:
- Pranav Mahajan
- Copyright date:
- 2025
- Notes:
- Homeostasis after injury: how intertwined inference and control underpin post-injury pain and behaviour, Neural associative skill memories for safer robotics and modeling human sensorimotor repertoires, Balancing safety and efficiency in human decision-making, and Composing the value signal for dopamine-mediated learning are derived from this thesis.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record