Large language models are fixated by red herrings: exploring creative problem solving and einstellung effect using the only connect wall dataset

Naeini, SA; Saqur, R; Saeidi, M; Giorgi, J; Taati, B

Conference item

Large language models are fixated by red herrings: exploring creative problem solving and einstellung effect using the only connect wall dataset

Abstract:: The quest for human imitative AI has been an enduring topic in AI research since its inception. The technical evolution and emerging capabilities of the latest cohort of large language models (LLMs) have reinvigorated the subject beyond academia to the cultural zeitgeist. While recent NLP evaluation benchmark tasks test some aspects of human-imitative behavior (e.g., BIG-bench's 'human-like behavior' tasks), few, if not none, examine creative problem solving abilities. Creative problem solving in humans is a well-studied topic in cognitive neuroscience with standardized tests that predominantly use the ability to associate (heterogeneous) connections among clue words as a metric for creativity. Exposure to misleading stimuli - distractors dubbed red herrings - impede human performance in such tasks via the fixation effect and Einstellung paradigm. In cognitive neuroscience studies, such fixations are experimentally induced by pre-exposing participants to orthographically similar incorrect words to subsequent word-fragments or clues. The popular British quiz show Only Connect's Connecting Wall segment essentially mimics Mednick's Remote Associates Test (RAT) formulation with built-in, deliberate red herrings, which makes it an ideal proxy task to explore and study the fixation effect and Einstellung paradigm from cognitive neuroscience in LLMs. In this paper, we present the novel Only Connect Wall (OCW) dataset and report results from our evaluation of selected pre-trained language models and LLMs on creative problem solving tasks like grouping clue words by heterogeneous connections and identifying correct open knowledge domain connections in respective groups. We synthetically generate two additional datasets: OCW-Randomized, OCW-WordNet to further analyze our red-herrings hypothesis in language models. The code and link to the dataset are available at https://github.com/TaatiTeam/OCW.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Naeini, S. A., Saqur, R., Saeidi, M., Giorgi, J., & Taati, B. (2023). Large language models are fixated by red herrings: exploring creative problem solving and einstellung effect using the only connect wall dataset. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 36, 5631–5652.

MLA Style

Naeini, SA, et al. “Large Language Models Are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect Using the Only Connect Wall Dataset.” 37th Conference on Neural Information Processing Systems (NeurIPS 2023), vol. 36, 2023, pp. 5631–52.

Chicago Style

Naeini, SA, R Saqur, M Saeidi, J Giorgi, and B Taati. 2023. “Large Language Models Are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect Using the Only Connect Wall Dataset.” In 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 36:5631–52. Curran Associates.
Print

Access Document

Files:: Naeini_et_al_2023_Large_language_models.pdf

(Preview, Accepted manuscript, pdf, 1.1MB, Terms of use)

Publisher copy:: 10.52202/075280-0246

Authors

+ Naeini, SA More by this author

Role:: Author

+ Saqur, R More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Mathematical Institute
Role:: Author
ORCID:: 0000-0002-6330-5480

+ Saeidi, M More by this author

Role:: Author

+ Giorgi, J More by this author

Role:: Author

+ Taati, B More by this author

Role:: Author

Publisher:: Curran Associates
Host title:: Advances in Neural Information Processing Systems 36 (NeurIPS 2023)
Volume:: 36
Pages:: 5631-5652
Publication date:: 2023-12-15
Acceptance date:: 2023-09-21
Event title:: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Event location:: New Orleans, USA
Event website:: https://neurips.cc/Conferences/2023
Event start date:: 2023-12-10
Event end date:: 2023-12-16
DOI:: 10.52202/075280-0246
EISBN:: 9781713899921

Language:: English
Pubs id:: 2366483
Local pid:: pubs:2366483
Deposit date:: 2026-04-02
ARK identifier:: ark:/29072/ora_c48549df2fa040b097da41ce193b158d

Terms of use

Copyright holder:: Naeini et al. and NeurIPS
Notes:: This is the accepted manuscript version of the article. The final version is available online from the Neural Information Processing Systems Foundation at: https://proceedings.neurips.cc/paper_files/paper/2023/hash/11e3e0f1b29dcd31bd0952bfc1357f68-Abstract-Datasets_and_Benchmarks.html

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

Large language models are fixated by red herrings: exploring creative problem solving and einstellung effect using the only connect wall dataset

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

Large language models are fixated by red herrings: exploring creative problem solving and einstellung effect using the only connect wall dataset

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions