Journal article
Road rage against the machine: humans and LLMs share a blame bias against driverless cars
- Abstract:
- Human language reflects our social values, biases, and moral judgments. Large language models (LLMs) trained on extensive human texts may therefore learn or encode such information, allowing them to generate responses within moral and ethical domains. Investigating whether LLMs exhibit human-like (including potentially biased or skewed) moral judgments is therefore crucial. Recent moral psychology research suggests that humans tend to have stronger negative reactions toward, and attribute more blame to, intelligent autonomous machines than to fellow humans for identical harm. Here we examine whether LLMs (OpenAI’s GPT-3.5 and GPT-4) exhibit a similar bias against machines in the specific domain of driverless cars. We replicate experiments from two previous studies in the USA and China and find that GPT-4 (but not GPT-3.5), similar to human participants reported previously, consistently rates machine drivers as more blameworthy and causally responsible than human drivers for identical traffic harm (Study 1), while also rating machine versus human drivers’ identical actions as more harmful and morally wrong (preregistered Study 2). This asymmetry in moral judgments is replicated across both LLMs and human participants in a new crash scenario that is unlikely to have been included in the LLMs’ training sets (preregistered Study 3). We discuss whether the blame bias against machines might be morally justified, and also propose that its presence in humans and LLMs could be due to different mechanisms.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 1.4MB, Terms of use)
-
- Publisher copy:
- 10.1080/10447318.2025.2526593
Authors
+ National Research Foundation
More from this funder
- Funder identifier:
- https://ror.org/03cpyc314
- Grant:
- AISG3-GV-2023-012
- Programme:
- AI Singapore Programme
+ Wellcome Trust
More from this funder
- Funder identifier:
- https://ror.org/029chgv08
- Grant:
- 226801/Z/22/Z
+ Zhejiang University
More from this funder
- Funder identifier:
- https://ror.org/00a2xv884
- Programme:
- Qiushi Program
- Publisher:
- Taylor and Francis Group
- Journal:
- International Journal of Human-Computer Interaction More from this journal
- Volume:
- 42
- Issue:
- 4
- Pages:
- 2121–2131
- Publication date:
- 2025-07-08
- Acceptance date:
- 2025-06-24
- DOI:
- EISSN:
-
1532-7590
- ISSN:
-
1044-7318
- Language:
-
English
- Keywords:
- Pubs id:
-
2244299
- Local pid:
-
pubs:2244299
- Deposit date:
-
2026-04-22
- ARK identifier:
Terms of use
- Copyright holder:
- Taylor & Francis Group, LLC
- Copyright date:
- 2025
- Rights statement:
- ©2025 Taylor & Francis Group, LLC
- Notes:
- The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record