Conference item icon

Conference item

Formalising human-in-the-loop: computational reductions, failure modes, and legal-moral responsibility

Abstract:
We use the notion of oracle machines and reductions from computability theory to formalise different Human-in-the-loop (HITL) setups for AI systems, distinguishing between trivial human monitoring (i.e., total functions), single endpoint human action (i.e., many-one reductions), and highly involved human-AI interaction (i.e., Turing reductions). We then proceed to show that the legal status and safety of different setups vary greatly. We present a taxonomy to categorise HITL failure modes, highlighting the practical limitations of HITL setups. We then identify omissions in UK and EU legal frameworks, which focus on HITL setups that may not always achieve the desired ethical, legal, and sociotechnical outcomes. We suggest areas where the law should recognise the effectiveness of different HITL setups and assign responsibility in these contexts, avoiding human `scapegoating'. Our work shows an unavoidable trade-off between attribution of legal responsibility, and technical explainability. Overall, we show how HITL setups involve many technical design decisions, and can be prone to failures out of the humans' control. Our formalisation and taxonomy opens up a new analytic perspective on the challenges in creating HITL setups, helping inform AI developers and lawmakers on designing HITL setups to better achieve their desired outcomes.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publication website:
https://openreview.net/forum?id=KR8viVTrX4

Authors

More by this author
Institution:
University of Oxford
Division:
SSD
Department:
Law
Oxford college:
Somerville College
Role:
Author
ORCID:
0009-0005-5289-9219


Publisher:
OpenReview
Host title:
Proceedings of the 14th International Conference on Learning Representations (ICLR 2026)
Article number:
8021
Publication date:
2026-01-26
Acceptance date:
2026-01-26
Event title:
14th International Conference on Learning Representations (ICLR 2026)
Event location:
Rio de Janeiro, Brazil
Event website:
https://iclr.cc/Conferences/2026
Event start date:
2026-04-23
Event end date:
2026-04-27


Language:
English
Keywords:
Pubs id:
2421974
Local pid:
pubs:2421974
Source identifiers:
W4416829997
Deposit date:
2026-06-06
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP