Sparsity-agnostic linear bandits with adaptive adversaries.

Tianyuan, Jin; Jang, Kyoungseok; Cesa-Bianchi, Nicolò

doi:10.48550/arXiv.2406.01192

Published October 2, 2024 | Version v1

Conference paper Open

Sparsity-agnostic linear bandits with adaptive adversaries.

1. School of Computing National University of Singapore
2. University of Milan
3. Politecnico di Milano

We study stochastic linear bandits where, in each round, the learner receives a set of actions (i.e., feature vectors), from which it chooses an element and obtains a stochastic reward. The expected reward is a fixed but unknown linear function of the chosen action. We study sparse regret bounds, that depend on the number S of non-zero coefficients in the linear reward function. Previous works focused on the case where S is known, or the action sets satisfy additional assumptions. In this work, we obtain the first sparse regret bounds that hold when S is unknown and the action sets are adversarially generated. Our techniques combine online to confidence set conversions with a novel randomized model selection approach over a hierarchy of nested confidence sets. When S is known, our analysis recovers state-of-the-art bounds for adversarial action sets. We also show that a variant of our approach, using Exp3 to dynamically select the confidence sets, can be used to improve the empirical performance of stochastic linear bandits while enjoying a regret bound with optimal dependence on the time horizon.

Files

2406.01192v1.pdf

Files (780.4 kB)

Name	Size	Download all
2406.01192v1.pdf md5:f57347eea8159116723481ebcf54becf	780.4 kB	Preview Download

Additional details

European Commission
ELIAS - European Lighthouse of AI for Sustainability 101120237

	All versions	This version
Views	89	89
Downloads	61	61
Data volume	47.6 MB	47.6 MB

Sparsity-agnostic linear bandits with adaptive adversaries.

Authors/Creators

Description

Files

2406.01192v1.pdf

Files (780.4 kB)

Additional details

Funding