Learning algorithms for online principal-agent problems (and selling goods online)

15 years 1 months ago

Download www.cs.duke.edu

In a principal-agent problem, a principal seeks to motivate an agent to take a certain action beneficial to the principal, while spending as little as possible on the reward. This is complicated by the fact that the principal does not know the agent's utility function (or type). We study the online setting where at each round, the principal encounters a new agent, and the principal sets the rewards anew. At the end of each round, the principal only finds out the action that the agent took, but not his type. The principal must learn how to set the rewards optimally. We show that this setting generalizes the setting of selling a digital good online. We study and experimentally compare three main approaches to this problem. First, we show how to apply a standard bandit algorithm to this setting. Second, for the case where the distribution of agent types is fixed (but unknown to the principal), we introduce a new gradient ascent algorithm. Third, for the case where the distribution o...

Vincent Conitzer, Nikesh Garera

Real-time Traffic

ICML 2006 | Machine Learning | Principal Encounters | Principal Seeks | Principal Sets |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2006
Where	ICML
Authors	Vincent Conitzer, Nikesh Garera

Comments (0)

Sciweavers

Learning algorithms for online principal-agent problems (and selling goods online)

ICML 2006 | Machine Learning | Principal Encounters | Principal Seeks | Principal Sets |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers