AI Collection

Conference item

It's a TRAP! Task-redirecting agent persuasion benchmark for web agents

Abstract:: Web-based agents powered by large language models are increasingly used for tasks such as email management or professional networking. Their reliance on dynamic web content, however, makes them vulnerable to prompt injection attacks: adversarial instructions hidden in interface elements that persuade the agent to divert from its original task. We introduce the Task-Redirecting Agent Persuasion Benchmark (TRAP), a benchmark for studying how persuasion techniques misguide autonomous web agents on realistic tasks. Across six frontier models, agents are susceptible to prompt injection in 25% of tasks on average (13% for GPT-5 to 43% for DeepSeek-R1), with small interface or contextual changes often doubling success rates and revealing systemic, psychologically driven vulnerabilities in web-based agents. We also provide a modular social-engineering injection framework with controlled experiments on high-fidelity website clones, allowing for further benchmark expansion.

Publication status:: Accepted

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Korgul, K., Yang, Y., Drohomireck, A., Błaszczyk, P., Howard, W., Aichberger, L., Russel, C., Torr, P. H. S., Mahdi, A., & Bibi, A. (2026). It's a TRAP! Task-redirecting agent persuasion benchmark for web agents. International Conference on Machine Learning (ICML 2026).

MLA Style

Korgul, K, et al. “It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents.” International Conference on Machine Learning (ICML 2026), 2026.

Chicago Style

Korgul, K, Y Yang, A Drohomireck, P Błaszczyk, W Howard, L Aichberger, C Russel, PHS Torr, A Mahdi, and A Bibi. 2026. “It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents.” In International Conference on Machine Learning (ICML 2026).
Print

Authors

+ Korgul, K More by this author

Institution:: University of Oxford
Division:: SSD
Department:: Oxford Internet Institute
Role:: Author

+ Yang, Y More by this author

Institution:: University of Oxford
Division:: SSD
Department:: Oxford Internet Institute
Role:: Author

+ Drohomireck, A More by this author

Role:: Author

+ Błaszczyk, P More by this author

Role:: Author

+ Howard, W More by this author

Role:: Author

More authors...

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842
Grant:: EP/W002981/1

+ Clarendon Fund Scholarship, University of Oxford More from this funder

Funder identifier:: https://ror.org/052gg0110

Host title:: Proceedings of Machine Learning Research (PMLR) 2026
Acceptance date:: 2026-05-26
Event title:: International Conference on Machine Learning (ICML 2026)
Event location:: Seoul, South Korea
Event website:: https://icml.cc/
Event start date:: 2026-07-06
Event end date:: 2026-07-11

Language:: English
Pubs id:: 2434023
Local pid:: pubs:2434023
Deposit date:: 2026-06-16
ARK identifier:: ark:/29072/ora_945b2a6c2eab43f48ea027296262ebfd

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP