Derivational morphology reveals analogical generalization in large language models

Hofmann, V; Weissweiler, L; Mortensen, DR; Schütze, H; Pierrehumbert, JB

AI Collection

Journal article

Derivational morphology reveals analogical generalization in large language models

Abstract:: What mechanisms underlie linguistic generalization in large language models (LLMs)? This question has attracted considerable attention, with most studies analyzing the extent to which the language skills of LLMs resemble rules. As of yet, it is not known whether linguistic generalization in LLMs could equally well be explained as the result of analogy. A key shortcoming of prior research is its focus on regular linguistic phenomena, for which rule-based and analogical approaches make the same predictions. Here, we instead examine derivational morphology, specifically English adjective nominalization, which displays notable variability. We introduce a method for investigating linguistic generalization in LLMs: Focusing on GPT-J, we fit cognitive models that instantiate rule-based and analogical learning to the LLM training data and compare their predictions on a set of nonce adjectives with those of the LLM, allowing us to draw direct conclusions regarding underlying mechanisms. As expected, rule-based and analogical models explain the predictions of GPT-J equally well for adjectives with regular nominalization patterns. However, for adjectives with variable nominalization patterns, the analogical model provides a much better match. Furthermore, GPT-J's behavior is sensitive to the individual word frequencies, even for regular forms, a behavior that is consistent with an analogical account but not a rule-based one. These findings refute the hypothesis that GPT-J's linguistic generalization on adjective nominalization involves rules, suggesting analogy as the underlying mechanism. Overall, our study suggests that analogical processes play a bigger role in the linguistic generalization of LLMs than previously thought.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Hofmann, V., Weissweiler, L., Mortensen, D. R., Schütze, H., & Pierrehumbert, J. B. (2025). Derivational morphology reveals analogical generalization in large language models. Proceedings of the National Academy of Sciences, 122(19).

MLA Style

Hofmann, V, et al. “Derivational Morphology Reveals Analogical Generalization in Large Language Models.” Proceedings of the National Academy of Sciences, vol. 122, no. 19, 2025.

Chicago Style

Hofmann, V, L Weissweiler, DR Mortensen, H Schütze, and JB Pierrehumbert. 2025. “Derivational Morphology Reveals Analogical Generalization in Large Language Models.” Proceedings of the National Academy of Sciences 122 (19).
Print

Access Document

Files:: Hofmann_et_al_2025_Derivational_morphology_reveals.pdf

(Preview, Version of record, eps, 1.4MB, Terms of use)

Publisher copy:: 10.1073/pnas.2423232122

Authors

+ Hofmann, V More by this author

Role:: Author

+ Weissweiler, L More by this author

Role:: Author

+ Mortensen, DR More by this author

Role:: Author

+ Schütze, H More by this author

Role:: Author

+ Pierrehumbert, JB More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Sub department:: Oxford e-Research Centre
Role:: Author
ORCID:: 0000-0002-5989-3574

+ European Research Council More from this funder

Funder identifier:: https://ror.org/0472cxd90
Grant:: 740516

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842
Grant:: EP/T023333/1

Publisher:: National Academy of Sciences
Journal:: Proceedings of the National Academy of Sciences More from this journal
Volume:: 122
Issue:: 19
Article number:: e2423232122
Publication date:: 2025-05-09
Acceptance date:: 2025-02-18
DOI:: 10.1073/pnas.2423232122
EISSN:: 1091-6490
ISSN:: 0027-8424

Language:: English
Keywords:: large language models

lexicon

analogy

linguistic rules

AI
Pubs id:: 2124012
Local pid:: pubs:2124012
Deposit date:: 2025-05-15
ARK identifier:: ark:/29072/ora_a9edfd1102444b58aa6938e5da2a3bef

Terms of use

Copyright holder:: Hofmann et al

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Journal article

Derivational morphology reveals analogical generalization in large language models

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Journal article

Derivational morphology reveals analogical generalization in large language models

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions