Journal article icon

Journal article

Road rage against the machine: humans and LLMs share a blame bias against driverless cars

Abstract:
Human language reflects our social values, biases, and moral judgments. Large language models (LLMs) trained on extensive human texts may therefore learn or encode such information, allowing them to generate responses within moral and ethical domains. Investigating whether LLMs exhibit human-like (including potentially biased or skewed) moral judgments is therefore crucial. Recent moral psychology research suggests that humans tend to have stronger negative reactions toward, and attribute more blame to, intelligent autonomous machines than to fellow humans for identical harm. Here we examine whether LLMs (OpenAI’s GPT-3.5 and GPT-4) exhibit a similar bias against machines in the specific domain of driverless cars. We replicate experiments from two previous studies in the USA and China and find that GPT-4 (but not GPT-3.5), similar to human participants reported previously, consistently rates machine drivers as more blameworthy and causally responsible than human drivers for identical traffic harm (Study 1), while also rating machine versus human drivers’ identical actions as more harmful and morally wrong (preregistered Study 2). This asymmetry in moral judgments is replicated across both LLMs and human participants in a new crash scenario that is unlikely to have been included in the LLMs’ training sets (preregistered Study 3). We discuss whether the blame bias against machines might be morally justified, and also propose that its presence in humans and LLMs could be due to different mechanisms.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1080/10447318.2025.2526593

Authors

More by this author
Institution:
University of Oxford
Division:
HUMS
Department:
Uehiro Institute
Oxford college:
St Cross College
Role:
Author
ORCID:
0000-0003-1691-6403
More by this author
Institution:
University of Oxford
Division:
HUMS
Department:
Uehiro Institute
Role:
Author
ORCID:
0000-0001-9691-2888


More from this funder
Funder identifier:
https://ror.org/03cpyc314
Grant:
AISG3-GV-2023-012
Programme:
AI Singapore Programme
More from this funder
Funder identifier:
https://ror.org/029chgv08
Grant:
226801/Z/22/Z
More from this funder
Funder identifier:
https://ror.org/00a2xv884
Programme:
Qiushi Program


Publisher:
Taylor and Francis Group
Journal:
International Journal of Human-Computer Interaction More from this journal
Volume:
42
Issue:
4
Pages:
2121–2131
Publication date:
2025-07-08
Acceptance date:
2025-06-24
DOI:
EISSN:
1532-7590
ISSN:
1044-7318


Language:
English
Keywords:
Pubs id:
2244299
Local pid:
pubs:2244299
Deposit date:
2026-04-22
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP