Journal article icon

Journal article

Continuously evolving rewards in an open-ended environment

Abstract:
Unambiguous identification of the rewards driving behaviours of entities operating in complex open-ended real-world environments is difficult, in part because goals and associated behaviours emerge endogenously and are dynamically updated as environments change. Reproducing such dynamics in models would be useful in many domains, particularly where fixed reward functions limit the adaptive capabilities of agents. Simulation experiments described here assess a candidate algorithm for the dynamic updating of the reward function, RULE: Reward Updating through Learning and Expectation. The approach is tested in a simplified ecosystem-like setting where experiments challenge entities' survival, calling for significant behavioural change. The population of entities successfully demonstrate the abandonment of an initially rewarded but ultimately detrimental behaviour, amplification of beneficial behaviour, and appropriate responses to novel items added to their environment. These adjustments happen through endogenous modification of the entities' reward function, during continuous learning, without external intervention.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publication website:
https://www.jmlr.org/papers/v26/24-0847.html

Authors

More by this author
Institution:
University of Oxford
Division:
SSD
Department:
SOGE
Sub department:
Environmental Change Institute
Role:
Author


Publisher:
Journal of Machine Learning Research
Journal:
Journal of Machine Learning Research More from this journal
Volume:
26
Issue:
62
Pages:
1-51
Publication date:
2025-02-17
Acceptance date:
2025-02-17
EISSN:
1533-7928
ISSN:
1532-4435


Language:
English
Keywords:
Pubs id:
2091301
Local pid:
pubs:2091301
Deposit date:
2025-02-23
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP