Sciweavers

WWW
2008
ACM

Collaborative filtering on skewed datasets

15 years 1 months ago
Collaborative filtering on skewed datasets
Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion is obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not. Categories and Subject Descriptors H.3.3 [Information Search and Retrieval]: Information Filtering General Terms Algorithms, Experimentation Keywords Collaborative filtering, skewed dataset, pLSA.
Somnath Banerjee, Krishnan Ramanathan
Added 21 Nov 2009
Updated 21 Nov 2009
Type Conference
Year 2008
Where WWW
Authors Somnath Banerjee, Krishnan Ramanathan
Comments (0)