Published August 9, 2024 | Version v1
Journal Open

Cooperative Online Learning with Feedback Graphs

  • 1. ROR icon University of Milan
  • 2. ROR icon Politecnico di Milano
  • 3. ROR icon Ottawa University
  • 4. ROR icon Université de Lille
  • 5. ROR icon Institut national de recherche en informatique et en automatique

Description

We study the interplay between communication and feedback in a cooperative online learning setting, where a network of communicating agents learn a common sequential decision-making task through a feedback graph. We bound the network regret in terms of the independence number of the strong product between the communication network and the feedback graph. Our analysis recovers as special cases many previously known bounds for cooperative online learning with expert or bandit feedback. We also prove an instance-based lower bound, demonstrating that our positive results are not improvable except in pathological cases. Experiments on synthetic data confirm our theoretical findings.

Files

2106.04982v5.pdf

Files (1.4 MB)

Name Size Download all
md5:b569a556dd6d72e960e717c46acf9797
1.4 MB Preview Download

Additional details

Funding

European Commission
ELIAS - European Lighthouse of AI for Sustainability 101120237