CoMAS: co-evolving multi-agent systems via interaction rewards

Xue, X; Zhou, Y; Zhang, G; Zhang, Z; Li, Y; Zhang, C; Yin, Z; Torr, P; Ouyang, W; Bai, L

AI Collection

Conference item

CoMAS: co-evolving multi-agent systems via interaction rewards

Abstract:: Self-evolution is a central research topic in enabling large language model (LLM)- based agents to continually improve their capabilities after pretraining. Recent research has witnessed a transition from reinforcement learning (RL)-free to RLbased methods. Current RL-based methods either rely on dense external reward signals or extract intrinsic reward signals from LLMs themselves. However, these approaches diverge from the self-evolution mechanisms observed in human intelligence, where individuals learn and improve through mutual discussion and collaboration. In this work, we introduce Co-Evolving Multi-Agent Systems (CoMAS), a novel framework that enables agents to improve autonomously by learning from inter-agent interactions without external supervision. CoMAS generates intrinsic rewards from rich discussion dynamics, employs an LLM-as-a-judge mechanism to formulate these rewards, and optimizes each agent’s policy through RL, thereby enabling decentralized and scalable co-evolution. Experimental results demonstrate that CoMAS consistently outperforms untrained agents and achieves stateof-the-art performance across most evaluation settings. Ablation studies confirm the necessity of interaction-based reward signals and reveal promising scalability as the number and diversity of agents increase. These findings establish CoMAS as a novel and effective paradigm for self-evolution in LLM-based agents. Our code is available at: https://github.com/xxyQwQ/CoMAS.

Publication status:: Accepted

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Xue, X., Zhou, Y., Zhang, G., Zhang, Z., Li, Y., Zhang, C., Yin, Z., Torr, P., Ouyang, W., & Bai, L. (2026). CoMAS: co-evolving multi-agent systems via interaction rewards. 14th International Conference on Learning Representations (ICLR 2026).

MLA Style

Xue, X, et al. “CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards.” 14th International Conference on Learning Representations (ICLR 2026), 2026.

Chicago Style

Xue, X, Y Zhou, G Zhang, Z Zhang, Y Li, C Zhang, Z Yin, P Torr, W Ouyang, and L Bai. 2026. “CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards.” In 14th International Conference on Learning Representations (ICLR 2026). OpenReview.
Print

Access Document

Files:: Xue_et_al_2026_CoMAS_co-evolving_multi-agent.pdf

(Preview, Accepted manuscript, pdf, 1.7MB, Terms of use)

Publication website:: https://openreview.net/forum?id=ihwAzktmWc

Authors

+ Xue, X More by this author

Role:: Author

+ Zhou, Y More by this author

Role:: Author

+ Zhang, G More by this author

Role:: Author

+ Zhang, Z More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Li, Y More by this author

Role:: Author

More authors...

Publisher:: OpenReview
Host title:: Proceedings of the 14th International Conference on Learning Representations (ICLR 2026)
Article number:: 5758
Acceptance date:: 2026-01-26
Event title:: 14th International Conference on Learning Representations (ICLR 2026)
Event location:: Rio de Janeiro, Brazil
Event website:: https://openreview.net/pdf?id=ihwAzktmWc
Event start date:: 2026-04-23
Event end date:: 2026-04-27

Language:: English
Pubs id:: 2433615
Local pid:: pubs:2433615
Deposit date:: 2026-06-15
ARK identifier:: ark:/29072/ora_40cd777413894f859d18ce84de2199b0

Terms of use

Copyright holder:: Xue et al.
Notes:: The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

CoMAS: co-evolving multi-agent systems via interaction rewards

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

CoMAS: co-evolving multi-agent systems via interaction rewards

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions