Working paper

On the Convergence of Reinforcement Learning.

Abstract:: This paper examines the convergence of payoffs and strategies in Erev and Roth's model of reinforcement learning. When all players use this rule it eliminates iteratively dominated strategies and in two-person constant-sum games average payoffs converge to the value of the game. Strategies converge in constant-sum games with unique equilibria if they are pure or if they are mixed and the game is 2 x 2. The long-run behaviour of the learning rule is governed by equations related to Maynard Smith's version of the replicator dynamic. Properties of the learning rule against general opponents are also studied.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Beggs, A. (2002). On the Convergence of Reinforcement Learning. Department of Economics (University of Oxford).

MLA Style

Beggs, A. On the Convergence of Reinforcement Learning. Department of Economics (University of Oxford), 2002.

Chicago Style

Beggs, A. 2002. “On the Convergence of Reinforcement Learning.” Discussion Papers. Department of Economics (University of Oxford).
Share
Print

Access Document

Files:: paper096.pdf

(Preview, pdf, 526.5KB, Terms of use)

Authors

+ Beggs, A More by this author

Role:: Author

Publisher:: Department of Economics (University of Oxford)
Series:: Discussion Papers
Publication date:: 2002-01-01

Language:: English
UUID:: uuid:c890fa3f-de61-4ee6-96d7-e265acade350
Local pid:: ora:1133
Deposit date:: 2011-08-15

Terms of use

Copyright date:: 2002

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP