Sciweavers

1071 search results - page 132 / 215
» A kernel-based approach to document retrieval
Sort
View
CEAS
2007
Springer
14 years 4 months ago
Hardening Fingerprinting by Context
Near-duplicate detection is not only an important pre and post processing task in Information Retrieval but also an effective spam-detection technique. Among different approache...
Aleksander Kolcz, Abdur Chowdhury
SIGIR
2010
ACM
14 years 1 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
ECIR
2010
Springer
13 years 11 months ago
Explicit Search Result Diversification through Sub-queries
Queries submitted to a retrieval system are often ambiguous. In such a situation, a sensible strategy is to diversify the ranking of results to be retrieved, in the hope that users...
Rodrygo L. T. Santos, Jie Peng, Craig Macdonald, I...
KDD
2007
ACM
122views Data Mining» more  KDD 2007»
14 years 10 months ago
Expertise modeling for matching papers with reviewers
An essential part of an expert-finding task, such as matching reviewers to submitted papers, is the ability to model the expertise of a person based on documents. We evaluate seve...
David M. Mimno, Andrew McCallum
AIRWEB
2009
Springer
14 years 4 months ago
Looking into the past to better classify web spam
Web spamming techniques aim to achieve undeserved rankings in search results. Research has been widely conducted on identifying such spam and neutralizing its influence. However,...
Na Dai, Brian D. Davison, Xiaoguang Qi