Can an individual manipulate the collective decisions of multi-agents?

Liu, F; Zhao, R; Chen, S; Li, G; Torr, P; Han, L; Gu, J

AI Collection

Conference item

Can an individual manipulate the collective decisions of multi-agents?

Abstract:: Individual Large Language Models (LLMs) have demonstrated significant capabilities across various domains, such as healthcare and law. Recent studies also show that coordinated multi-agent systems exhibit enhanced decision-making and reasoning abilities through collaboration. However, due to the vulnerabilities of individual LLMs and the difficulty of accessing all agents in a multi-agent system, a key question arises: If attackers only know one agent, could they still generate adversarial samples capable of misleading the collective decision?To explore this question, we formulate it as a game with incomplete information, where attackers know only one target agent and lack knowledge of the other agents in the system. With this formulation, we propose M-Spoiler, a framework that simulates agent interactions within a multi-agent system to generate adversarial samples. These samples are then used to manipulate the target agent in the target system, misleading the system’s collaborative decision-making process.More specifically, M-Spoiler introduces a stubborn agent that actively aids in optimizing adversarial samples by simulating potential stubborn responses from agents in the target system. This enhances the effectiveness of the generated adversarial samples in misleading the system.Through extensive experiments across various tasks, our findings confirm the risks posed by the knowledge of an individual agent in multi-agent systems and demonstrate the effectiveness of our framework.We also explore several defense mechanisms, showing that our proposed attack framework remains more potent than baselines, underscoring the need for further research into defensive strategies.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Liu, F., Zhao, R., Chen, S., Li, G., Torr, P., Han, L., & Gu, J. (2025). Can an individual manipulate the collective decisions of multi-agents? Empirical Methods in Natural Language Processing (EMNLP 2025), 12169–12193.

MLA Style

Liu, F, et al. “Can an Individual Manipulate the Collective Decisions of Multi-Agents?” Empirical Methods in Natural Language Processing (EMNLP 2025), 2025, pp. 12169–93.

Chicago Style

Liu, F, R Zhao, S Chen, G Li, P Torr, L Han, and J Gu. 2025. “Can an Individual Manipulate the Collective Decisions of Multi-Agents?” In Empirical Methods in Natural Language Processing (EMNLP 2025), 12169–93. Association for Computational Linguistics.
Print

Access Document

Files:: Liu_et_al_2025_Can_an_individual.pdf

(Preview, Version of record, pdf, 1.8MB, Terms of use)

Publisher copy:: 10.18653/v1/2025.emnlp-main.611

Authors

+ Liu, F More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Zhao, R More by this author

Role:: Author

+ Chen, S More by this author

Role:: Author

+ Li, G More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Torr, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author
ORCID:: 0009-0006-0259-5732

More authors...

Publisher:: Association for Computational Linguistics
Host title:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Pages:: 12169-12193
Publication date:: 2025-11-04
Acceptance date:: 2025-08-21
Event title:: Empirical Methods in Natural Language Processing (EMNLP 2025)
Event location:: Suzhou, China
Event website:: https://2025.emnlp.org/
Event start date:: 2025-11-04
Event end date:: 2025-11-09
DOI:: 10.18653/v1/2025.emnlp-main.611
ISBN:: 9798891763326

Language:: English
Pubs id:: 2328662
Local pid:: pubs:2328662
Deposit date:: 2025-11-17
ARK identifier:: ark:/29072/ora_a93368bbb7b14661803e2ce24f74119a

Terms of use

Copyright holder:: Liu et al.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

Can an individual manipulate the collective decisions of multi-agents?

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

Can an individual manipulate the collective decisions of multi-agents?

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions