Published March 5, 2021 | Version v1
Conference paper Open

Optimally Deceiving a Learning Leader in Stackelberg Games

  • 1. Sapienza University of Rome
  • 2. University of Oxford
  • 3. University of Essex

Description

Recent results in the ML community have revealed that learning algorithms used to compute the optimal strategy for the leader to commit to in a Stackelberg game, are susceptible to manipulation by the follower. Such a learning algorithm operates by querying the best responses or the payoffs of the follower, who consequently can deceive the algorithm by responding as if their payoffs were much different than what they actually are. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the payoffs that would make the learning algorithm compute a commitment so that best responding to it maximizes the follower's utility, according to the true payoffs. While this problem has been considered before, the related literature only focused on the simplified scenario in which the payoff space is finite, thus leaving the general version of the problem unanswered. In this paper, we fill this gap by showing that it is always possible for the follower to efficiently compute (near-)optimal payoffs for various scenarios of learning interaction between the leader and the follower.

Files

2020__NEURIPS__Optimally_Deceiving_a_Learning_Leader_in_Stackelberg_Games.pdf

Additional details

Funding

European Commission
AMDROMA - Algorithmic and Mechanism Design Research in Online MArkets 788893