Sum-max submodular bandits

Pasteris, Stephen; Rumi, Alberto; Vitale, Fabio; Cesa-Bianchi, Nicolò

doi:10.5281/zenodo.13881304

Published October 2, 2024 | Version v1

Conference paper Open

Sum-max submodular bandits

1. The Alan Turing Institute
2. University of Milan
3. CENTAI
4. Politecnico di Milano

Many online decision-making problems correspond to maximizing a sequence of submodular functions. In this work, we introduce sum-max functions, a subclass of monotone submodular functions capturing several interesting problems, including best-of-K-bandits, combinatorial bandits, and the bandit versions on M-medians and hitting sets. We show that all functions in this class satisfy a key property that we call pseudo-concavity. This allows us to prove (1-1/e)-regret bounds for bandit feedback in the nonstochastic setting of the order of sqrt(MKT) (ignoring log factors), where T is the time horizon and M is a cardinality constraint. This bound, attained by a simple and efficient algorithm, significantly improves on the O(T^2/3) regret bound for online monotone submodular maximization with bandit feedback. We also extend our results to a bandit version of the facility location problem.

Files

pasteris24a.pdf

Files (20.1 MB)

Name	Size	Download all
pasteris24a.pdf md5:34983ac82e9f47fda607021f69da7486	20.1 MB	Preview Download

Additional details

European Commission
ELIAS - European Lighthouse of AI for Sustainability 101120237

Views

Downloads

Show more details

	All versions	This version
Views	41	41
Downloads	39	39
Data volume	805.9 MB	805.9 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

Neural Information Processing Systems (NEURIPS), December 2024

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: October 8, 2024
Modified: October 8, 2024

Sum-max submodular bandits

Creators

Description

Files

pasteris24a.pdf

Files (20.1 MB)

Additional details

Funding