We use Wikipedia articles to semantically inform the generation of query models. To this end, we apply supervised machine learning to automatically link queries to Wikipedia artic...
We report on the construction of the PAN Wikipedia vandalism corpus, PAN-WVC-10, using Amazon’s Mechanical Turk. The corpus compiles 32 452 edits on 28 468 Wikipedia articles, a...
In this paper, we propose a new application of Bayesian language model based on Pitman-Yor process for information retrieval. This model is a generalization of the Dirichlet distr...
We propose a method to predict a user’s favourite locations in a city, based on his Flickr geotags in other cities. We define a similarity between the geotag distributions of t...
Maarten Clements, Pavel Serdyukov, Arjen P. de Vri...
We consider blog feed search: identifying relevant blogs for a given topic. An individual’s search behavior often involves a combination of exploratory behavior triggered by sal...
Wouter Weerkamp, Krisztian Balog, Maarten de Rijke