Learning in Online Principal-Agent Interactions: The Power of Menus

Minbiao Han; Michael Albert; Haifeng Xu

doi:10.1609/aaai.v38i16.29691

Authors

Minbiao Han Department of Computer Science, The University of Chicago
Michael Albert Darden Business School, University of Virginia
Haifeng Xu Department of Computer Science, The University of Chicago

DOI:

https://doi.org/10.1609/aaai.v38i16.29691

Keywords:

MAS: Multiagent Learning, GTEP: Imperfect Information, MAS: Modeling other Agents

Abstract

We study a ubiquitous learning challenge in online principal-agent problems during which the principal learns the agent's private information from the agent's revealed preferences in historical interactions. This paradigm includes important special cases such as pricing and contract design, which have been widely studied in recent literature. However, existing work considers the case where the principal can only choose a single strategy at every round to interact with the agent and then observe the agent's revealed preference through their actions. In this paper, we extend this line of study to allow the principal to offer a menu of strategies to the agent and learn additionally from observing the agent's selection from the menu. We provide a thorough investigation of several online principal-agent problem settings and characterize their sample complexities, accompanied by the corresponding algorithms we have developed. We instantiate this paradigm to several important design problems — including Stackelberg (security) games, contract design, and information design. Finally, we also explore the connection between our findings and existing results about online learning in Stackelberg games, and we offer a solution that can overcome a key hard instance of previous work.

Learning in Online Principal-Agent Interactions: The Power of Menus

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription