Conference item
MIP against agent: Malicious Image Patches hijacking multimodal OS agents
- Abstract:
- Recent advances in operating system (OS) agents have enabled vision-language models (VLMs) to directly control a user’s computer. Unlike conventional VLMs that passively output text, OS agents autonomously perform computer-based tasks in response to a single user prompt. OS agents do so by capturing, parsing, and analysing screenshots and executing low-level actions via application programming interfaces (APIs), such as mouse clicks and keyboard inputs. This direct interaction with the OS significantly raises the stakes, as failures or manipulations can have immediate and tangible consequences. In this work, we uncover a novel attack vector against these OS agents: Malicious Image Patches (MIPs), adversarially perturbed screen regions that, when captured by an OS agent, induce it to perform harmful actions by exploiting specific APIs. For instance, a MIP can be embedded in a desktop wallpaper or shared on social media to cause an OS agent to exfiltrate sensitive user data. We show that MIPs generalise across user prompts and screen configurations, and that they can hijack multiple OS agents even during the execution of benign instructions. These findings expose critical security vulnerabilities in OSagents that have to be carefully addressed before their widespread deployment.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 10.2MB, Terms of use)
-
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/W002981/1
- Publisher:
- NeurIPS
- Host title:
- Advances in Neural Information Processing Systems 38
- Pages:
- 18536-18575
- Publication date:
- 2026-02-02
- Acceptance date:
- 2025-09-19
- Event title:
- 39th Annual Conference on Neural Information Processing Systems (NeurIPS 2025)
- Event location:
- San Diego, CA, USA
- Event website:
- https://neurips.cc/Conferences/2025
- Event start date:
- 2025-12-02
- Event end date:
- 2025-12-07
- Language:
-
English
- Pubs id:
-
2433729
- Local pid:
-
pubs:2433729
- Deposit date:
-
2026-06-15
- ARK identifier:
Terms of use
- Copyright holder:
- Aichberger et al and Neural Information Processing Systems Foundation Inc.
- Copyright date:
- 2023
- Rights statement:
- © (2026) by individual authors and Neural Information Processing Systems Foundation Inc. All rights reserved.
- Notes:
- The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record