Balancing Exploration and Exploitation in Learning to Rank Online

13 years 7 months ago

Download staff.science.uva.nl

Abstract. As retrieval systems become more complex, learning to rank approaches are being developed to automatically tune their parameters. Using online learning to rank approaches, retrieval systems can learn directly from implicit feedback, while they are running. In such an online setting, algorithms need to both explore new solutions to obtain feedback for effective learning, and exploit what has already been learned to produce results that are acceptable to users. We formulate this challenge as an exploration-exploitation dilemma and present the ﬁrst online learning to rank algorithm that works with implicit feedback and balances exploration and exploitation. We leverage existing learning to rank data sets and recently developed click models to evaluate the proposed algorithm. Our results show that ﬁnding a balance between exploration and exploitation can substantially improve online retrieval performance, bringing us one step closer to making online learning to rank work in p...

Katja Hofmann, Shimon Whiteson, Maarten de Rijke

Real-time Traffic

Click Models | ECIR 2011 | Implicit Feedback | Information Technology | Retrieval Performance |

claim paper

Post Info
More Details (n/a)

Added	27 Aug 2011
Updated	27 Aug 2011
Type	Journal
Year	2011
Where	ECIR
Authors	Katja Hofmann, Shimon Whiteson, Maarten de Rijke

Comments (0)

Sciweavers

Balancing Exploration and Exploitation in Learning to Rank Online

Click Models | ECIR 2011 | Implicit Feedback | Information Technology | Retrieval Performance |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers