Preprint

On the necessity of adaptive regularisation: optimal anytime online learning on ℓp-balls

Abstract:: We study online convex optimization on $\ell_p$-balls in $\mathbb{R}^d$ for $p > 2$. While always sub-linear, the optimal regret exhibits a shift between the high-dimensional setting ($d > T$), when the dimension $d$ is greater than the time horizon $T$ and the low-dimensional setting ($d \leq T$). We show that Follow-the-Regularised-Leader (FTRL) with time-varying regularisation which is adaptive to the dimension regime is anytime optimal for all dimension regimes. Motivated by this, we ask whether it is possible to obtain anytime optimality of FTRL with fixed non-adaptive regularisation. Our main result establishes that for separable regularisers, adaptivity in the regulariser is necessary, and that any fixed regulariser will be sub-optimal in one of the two dimension regimes. Finally, we provide lower bounds which rule out sub-linear regret bounds for the linear bandit problem in sufficiently high-dimension for all $\ell_p$-balls with $p \geq 1$.

Publication status:: Published

Peer review status:: Not peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Johnson, E., Martinez-Rubio, D., Pike-Burke, C., & Rebeschini, P. (2025). On the necessity of adaptive regularisation: optimal anytime online learning on ℓp-balls. arXiv.

MLA Style

Johnson, E, et al. On the Necessity of Adaptive Regularisation: Optimal Anytime Online Learning on ℓp-Balls. arXiv, 2025.

Chicago Style

Johnson, E, D Martinez-Rubio, C Pike-Burke, and P Rebeschini. 2025. On the Necessity of Adaptive Regularisation: Optimal Anytime Online Learning on ℓp-Balls. ArXiv.
Print

Access Document

Files:: Johnson_et_al_2025_On_the_necessity.pdf

(Preview, Pre-print, pdf, 764.5KB, Terms of use)

Preprint server copy:: 10.48550/arxiv.2506.19752

Authors

+ Johnson, E More by this author

Role:: Author

+ Martinez-Rubio, D More by this author

Role:: Author

+ Pike-Burke, C More by this author

Role:: Author

+ Rebeschini, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author
ORCID:: 0000-0001-7772-4160

Preprint server:: arXiv
Publication date:: 2025-06-24
DOI:: 10.48550/arxiv.2506.19752
EISSN:: 2331-8422

Language:: English
Pubs id:: 2244164
UUID:: uuid_a0873b3f-3ea9-4b2f-8636-53ad6d9783eb
Local pid:: pubs:2244164
Source identifiers:: W4414684661
Deposit date:: 2026-01-05
ARK identifier:: ark:/29072/ora_a0873b3f3ea94b2f863653ad6d9783eb

Terms of use

Copyright holder:: Johnson et al
Copyright date:: 2025
Rights statement:: ©2025 The Authors. This paper is an open access article distributed under the terms of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/)

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP