Non-stationary bandit convex optimization: a comprehensive study

Liu, X; Baudry, D; Zimmert, J; Rebeschini, P; Akhavan, A

AI Collection

Preprint

Non-stationary bandit convex optimization: a comprehensive study

Abstract:: Bandit Convex Optimization is a fundamental class of sequential decision-making problems, where the learner selects actions from a continuous domain and observes a loss (but not its gradient) at only one point per round. We study this problem in non-stationary environments, and aim to minimize the regret under three standard measures of non-stationarity: the number of switches $S$ in the comparator sequence, the total variation $\Delta$ of the loss functions, and the path-length $P$ of the comparator sequence. We propose a polynomial-time algorithm, Tilted Exponentially Weighted Average with Sleeping Experts (TEWA-SE), which adapts the sleeping experts framework from online convex optimization to the bandit setting. For strongly convex losses, we prove that TEWA-SE is minimax-optimal with respect to known $S$ and $\Delta$ by establishing matching upper and lower bounds. By equipping TEWA-SE with the Bandit-over-Bandit framework, we extend our analysis to environments with unknown non-stationarity measures. For general convex losses, we introduce a second algorithm, clipped Exploration by Optimization (cExO), based on exponential weights over a discretized action space. While not polynomial-time computable, this method achieves minimax-optimal regret with respect to known $S$ and $\Delta$, and improves on the best existing bounds with respect to $P$.

Publication status:: Published

Peer review status:: Not peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Liu, X., Baudry, D., Zimmert, J., Rebeschini, P., & Akhavan, A. (2025). Non-stationary bandit convex optimization: a comprehensive study. arXiv.

MLA Style

Liu, X, et al. Non-Stationary Bandit Convex Optimization: a Comprehensive Study. arXiv, 2025.

Chicago Style

Liu, X, D Baudry, J Zimmert, P Rebeschini, and A Akhavan. 2025. Non-Stationary Bandit Convex Optimization: a Comprehensive Study. ArXiv.
Print

Access Document

Files:: Liu_et_al_2025_Non-stationary_bandit_convex.pdf

(Preview, Pre-print, pdf, 700.5KB, Terms of use)

Preprint server copy:: 10.48550/arXiv.2506.02980

Authors

+ Liu, X More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author

+ Baudry, D More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author

+ Zimmert, J More by this author

Role:: Author

+ Rebeschini, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author
ORCID:: 0000-0001-7772-4160

+ Akhavan, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author

Preprint server:: arXiv
Publication date:: 2025-06-03
DOI:: 10.48550/arXiv.2506.02980
EISSN:: 2331-8422

Language:: English
Pubs id:: 2128784
Local pid:: pubs:2128784
Deposit date:: 2026-01-05
ARK identifier:: ark:/29072/ora_07fb0feffd754a89813619afedd30952

Terms of use

Copyright holder:: Liu et al
Rights statement:: ©2025 The Authors. This paper is an open access article distributed under the terms of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/)

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Preprint

Non-stationary bandit convex optimization: a comprehensive study

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Preprint

Non-stationary bandit convex optimization: a comprehensive study

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions