Journal article icon

Journal article

On the Convergence of Reinforcement Learning.

Abstract:
This paper examines the convergence of payoffs and strategies in Erev and Roth's model of reinforcement learning. When all players use this rule it eliminates iteratively dominated strategies and in two-person constant-sum games average payoffs converge to the value of the game. Strategies converge in constant-sum games with unique equilibria if they are pure or if they are mixed and the game is 2 x 2. The long-run behaviour of the learning rule is governed by equations related to Maynard Smith's version of the replicator dynamic. Properties of the learning rule against general opponents are also studied.

Actions


Access Document


Files:
Publisher copy:
10.1016/j.jet.2004.03.008

Authors



Publisher:
Elsevier
Journal:
Journal of Economic Theory More from this journal
Volume:
122
Issue:
1
Pages:
1 - 36
Publication date:
2005-01-01
DOI:
ISSN:
0022-0531


UUID:
uuid:97339c58-0d4c-40ca-a289-ce0e96fa04d6
Local pid:
oai:economics.ouls.ox.ac.uk:14191
Deposit date:
2011-08-15

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP