Conference item
Unlocking in-context learning for natural datasets beyond language modelling
- Abstract:
- Large Language Models (LLMs) exhibit In-Context Learning (ICL), which enables the model to perform new tasks conditioning only on the examples provided in the context without updating the model’s weights. While ICL offers fast adaptation across natural language tasks and domains, its emergence is less straightforward for modalities beyond text. In this work, we systematically uncover properties present in LLMs that support the emergence of ICL for autoregressive models and various modalities by promoting the learning of the needed mechanisms for ICL. We identify exact token repetitions in the training data sequences as an important factor for ICL. Such repetitions further improve stability and reduce transiency in ICL performance. Moreover, we emphasise the significance of training task difficulty for the emergence of ICL. Finally, by applying our novel insights on ICL emergence, we unlock ICL capabilities for various visual datasets and a more challenging EEG classification task. Code is available at https://github.com/jelenab98/unlocking_icl.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 7.1MB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-032-12840-9_20
Authors
+ German Research Foundation
More from this funder
- Funder identifier:
- https://ror.org/018mejw64
- Grant:
- 499552394
- Publisher:
- Springer
- Host title:
- Pattern Recognition
- Pages:
- 303-319
- Series:
- Lecture Notes in Computer Science
- Series number:
- 16125
- Publication date:
- 2026-01-02
- Acceptance date:
- 2025-07-16
- Event title:
- DAGM German Conference on Pattern Recognition (GCPR 2025)
- Event location:
- Freiburg, Germany
- Event website:
- https://www.dagm-gcpr.de/year/2025
- Event start date:
- 2025-09-23
- Event end date:
- 2025-09-26
- DOI:
- EISSN:
-
1611-3349
- ISSN:
-
0302-9743
- EISBN:
- 9783032128409
- ISBN:
- 9783032128393
- Language:
-
English
- Keywords:
- Pubs id:
-
2366260
- Local pid:
-
pubs:2366260
- Deposit date:
-
2026-03-19
- ARK identifier:
Terms of use
- Copyright holder:
- Bratulić et al
- Copyright date:
- 2026
- Rights statement:
- © 2026 The Author(s), under exclusive license to Springer Nature Switzerland AG
- Notes:
- The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record