Composing the value signal for dopamine-mediated learning

Mahajan, P; Seymour, B

Abstract:: The seminal reward prediction error theory of dopamine function faces several key challenges. Most notable is the difficulty learning multiple rewards simultaneously, inefficient on-policy learning, and accounting for heterogeneous striatal responses in the tail of the striatum. We propose a normative framework, based on linear reinforcement learning, that redefines dopamine’s computational objective. We propose that dopamine optimises not just cumulative rewards, but a reward value function augmented by a penalty for deviating from a default behavioural policy, which effectively confers value on controllability. Our simulations show that this single modification enables optimal value composition, fast and robust adaptation to changing priorities, safer exploration in the context of threats, and stable learning amid uncertainty. Critically, this unifies disparate striatal observations, parsimoniously reconciling threat and action prediction error signals within the striatal tail. Our framework refines the core principle governing striatal dopamine, bridging theory with neural data and offering testable predictions.

Publication status:: Published

Peer review status:: Not peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Mahajan, P., & Seymour, B. (2025). Composing the value signal for dopamine-mediated learning. bioRxiv.

MLA Style

Mahajan, P, and B Seymour. Composing the Value Signal for Dopamine-Mediated Learning. bioRxiv, 2025.

Chicago Style

Mahajan, P, and B Seymour. 2025. Composing the Value Signal for Dopamine-Mediated Learning. BioRxiv.
Print

Access Document

Files:: Mahajan_and_Seymour_2025_Composing_the_value.pdf

(Preview, Pre-print, pdf, 16.0MB, Terms of use)

Preprint server copy:: 10.1101/2025.10.10.681616

Authors

+ Mahajan, P More by this author

Institution:: University of Oxford
Division:: MSD
Department:: Clinical Neurosciences
Role:: Author
ORCID:: 0009-0001-2507-5450

+ Seymour, B More by this author

Institution:: University of Oxford
Division:: MSD
Department:: Clinical Neurosciences
Role:: Author
ORCID:: 0000-0003-1724-5832

+ Wellcome Trust More from this funder

Funder identifier:: https://ror.org/029chgv08
Grant:: 203139/A/16/Z; 214251/Z/18/Z; 203139/Z/16/Z

+ Japan Society for the Promotion of Science More from this funder

Funder identifier:: https://ror.org/00hhkn466
Grant:: 22H04998

+ Institute of Information & Communications Technology Planning & Evaluation More from this funder

Funder identifier:: https://ror.org/01g0hqq23
Grant:: MSIT 2019-0-01371

+ National Institute for Health and Care Research More from this funder

Grant:: NIHR203316

Preprint server:: bioRxiv
Publication date:: 2025-11-22
DOI:: 10.1101/2025.10.10.681616

Language:: English
Pubs id:: 2303799
UUID:: uuid_fe2b180f-a8c6-4289-b5a5-e6e98a059568
Local pid:: pubs:2303799
Source identifiers:: W4415055624
Deposit date:: 2026-01-28
ARK identifier:: ark:/29072/ora_fe2b180fa8c64289b5a5e6e98a059568

Terms of use

Copyright holder:: Mahajan and Seymour
Rights statement:: The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Notes:: This work is related to the thesis Safe learning in humans and machines.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Preprint

Composing the value signal for dopamine-mediated learning

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Preprint

Composing the value signal for dopamine-mediated learning

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Related Items

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions