Sciweavers

SIGIR
2009
ACM

Smoothing clickthrough data for web search ranking

14 years 7 months ago
Smoothing clickthrough data for web search ranking
Incorporating features extracted from clickthrough data (called clickthrough features) has been demonstrated to significantly improve the performance of ranking models for Web search applications. Such benefits, however, are severely limited by the data sparseness problem, i.e., many queries and documents have no or very few clicks. The ranker thus cannot rely strongly on clickthrough features for document ranking. This paper presents two smoothing methods to expand clickthrough data: query clustering via Random Walk on click graphs and a discounting method inspired by the Good-Turing estimator. Both methods are evaluated on real-world data in three Web search domains. Experimental results show that the ranking models trained on smoothed clickthrough features consistently outperform those trained on unsmoothed features. This study demonstrates both the importance and the benefits of dealing with the sparseness problem in clickthrough data. Categories and Subject Descriptors H.3.3 [Inf...
Jianfeng Gao, Wei Yuan, Xiao Li, Kefeng Deng, Jian
Added 28 May 2010
Updated 28 May 2010
Type Conference
Year 2009
Where SIGIR
Authors Jianfeng Gao, Wei Yuan, Xiao Li, Kefeng Deng, Jian-Yun Nie
Comments (0)