Preprint
Stochastic shortest path with sparse adversarial costs
- Abstract:
- We study the adversarial Stochastic Shortest Path (SSP) problem with sparse costs under full-information feedback. In the known transition setting, existing bounds based on Online Mirror Descent (OMD) with negative-entropy regularization scale with $\sqrt{\log(SA)}$, where $SA$ is the size of the state-action space. While we show that this is optimal in the worst-case, this bound fails to capture the benefits of sparsity when only a small number $M \ll SA$ of state-action pairs incur cost. In fact, we also show that the negative-entropy is inherently non-adaptive to sparsity: it provably incurs regret scaling with $\sqrt{\log(S)}$ on sparse problems. Instead, we propose a family of $\ell_{r}$-norm regularizers ($r \in (1,2)$) that adapts to the sparsity and achieves regret scaling with $\sqrt{\log(M)}$ instead of $\sqrt{\log(SA)}$. We show this is optimal via a matching lower bound, highlighting that captures the effective dimension of the problem instead of $SA$. Finally, in the unknown transition setting the benefits of sparsity are limited: we prove that even on sparse problems, the minimax regret for any learner scales polynomially with $SA$.
- Publication status:
- Published
- Peer review status:
- Not peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Pre-print, pdf, 583.6KB, Terms of use)
-
- Preprint server copy:
- 10.48550/arXiv.2511.00637
Authors
- Preprint server:
- arXiv
- Publication date:
- 2025-11-01
- DOI:
- EISSN:
-
2331-8422
- Language:
-
English
- Pubs id:
-
2338612
- Local pid:
-
pubs:2338612
- Deposit date:
-
2026-01-05
- ARK identifier:
Terms of use
- Copyright holder:
- Johnson et al
- Copyright date:
- 2025
- Rights statement:
- ©2025 The Authors. This paper is an open access article distributed under the terms of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/)
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record