When do prompting and prefix-tuning work? a theory of capabilities and limitations

Petrov, A; Torr, P; Bibi, A

Conference item

When do prompting and prefix-tuning work? a theory of capabilities and limitations

Abstract:: Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning with a fraction of the parameters. Despite their empirical successes, there is little theoretical understanding of how these techniques influence the internal computation of the model and their expressiveness limitations. We show that despite the continuous embedding space being more expressive than the discrete token space, soft-prompting and prefix-tuning are strictly less expressive than full fine-tuning, even with the same number of learnable parameters. Concretely, context-based fine-tuning cannot change the relative attention pattern over the content and can only bias the outputs of an attention layer in a fixed direction. This suggests that while techniques like prompting, in-context learning, soft prompting, and prefixtuning can effectively elicit skills present in the pretrained model, they cannot learn novel tasks that require new attention patterns.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Petrov, A., Torr, P., & Bibi, A. (2024). When do prompting and prefix-tuning work? a theory of capabilities and limitations. 12th International Conference on Learning Representations (ICLR 2024).

MLA Style

Petrov, A, et al. “When Do Prompting and Prefix-Tuning Work? a Theory of Capabilities and Limitations.” 12th International Conference on Learning Representations (ICLR 2024), 2024.

Chicago Style

Petrov, A, P Torr, and A Bibi. 2024. “When Do Prompting and Prefix-Tuning Work? a Theory of Capabilities and Limitations.” In 12th International Conference on Learning Representations (ICLR 2024). OpenReview.
Print

Access Document

Files:: Petrov_et_al_2024_When_do_prompting.pdf

(Preview, Version of record, pdf, 1.3MB, Terms of use)

Publication website:: https://openreview.net/forum?id=JewzobRhay

Authors

+ Petrov, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Torr, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author
ORCID:: 0009-0006-0259-5732

+ Bibi, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Engineering and Physical Sciences Research Council More from this funder

Grant:: EP/W002981/1

Publisher:: OpenReview
Host title:: Proceedings of the 12th International Conference on Learning Representations (ICLR 2024)
Publication date:: 2024-01-16
Acceptance date:: 2024-01-15
Event title:: 12th International Conference on Learning Representations (ICLR 2024)
Event location:: Vienna, Austria
Event website:: https://iclr.cc/Conferences/2024
Event start date:: 2024-05-07
Event end date:: 2024-05-11

Language:: English
Keywords:: prompt

theory

prefix

LLM

fine-tuning
Pubs id:: 1838323
Local pid:: pubs:1838323
Deposit date:: 2024-03-18
ARK identifier:: ark:/29072/ora_b678f868391c4a50b8eac6d377b93659

Terms of use

Notes:: This paper was presented at the International Conference on Learning Representations (ICLR 2024), 7th - 11th May 2024, Vienna, Austria.

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

When do prompting and prefix-tuning work? a theory of capabilities and limitations

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

When do prompting and prefix-tuning work? a theory of capabilities and limitations

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions