Improving Quality of Training Data for Learning to Rank Using Click-Through Data

16 years 4 months ago

Download www.wsdm-conference.org

In information retrieval, relevance of documents with respect to queries is usually judged by humans, and used in evaluation and/or learning of ranking functions. Previous work has shown that certain level of noise in relevance judgments has little effect on evaluation, especially for comparison purposes. Recently learning to rank has become one of the major means to create ranking models in which the models are automatically learned from the data derived from a large number of relevance judgments. As far as we know, there was no previous work about quality of training data for learning to rank, and this paper tries to study the issue. Specifically, we address three problems. Firstly, we show that the quality of training data labeled by humans has critical impact on the performance of learning to rank algorithms. Secondly, we propose detecting relevance judgment errors using click-through data accumulated at a search engine. Two discriminative models, referred to as sequential depende...

Jingfang Xu, Chuanliang Chen, Gu Xu, Hang Li, Elbi

Real-time Traffic

Data Mining | Evaluation And/or Learning | Relevance Judgment Errors | Relevance Judgments | WSDM 2010 |

claim paper

» Learning Topical Transition Probabilities in Click Through Data with Regression Models

» Training Data Cleaning for Text Classification

» GDR a system for guided data repair

» Improving SVM accuracy by training on auxiliary data sources

» Improving web page classification by labelpropagation over click graphs

» Guided Data Repair

» CrossMarket Model Adaptation with Pairwise Preference Data for Web Search Ranking

Post Info
More Details (n/a)

Added	01 Mar 2010
Updated	02 Mar 2010
Type	Conference
Year	2010
Where	WSDM
Authors	Jingfang Xu, Chuanliang Chen, Gu Xu, Hang Li, Elbio Renato Torres Abib

Comments (0)

Sciweavers

Improving Quality of Training Data for Learning to Rank Using Click-Through Data

Data Mining | Evaluation And/or Learning | Relevance Judgment Errors | Relevance Judgments | WSDM 2010 |

Explore & Download

Productivity Tools

Sciweavers