Sciweavers

550 search results - page 99 / 110
» A general magnitude-preserving boosting algorithm for search...
Sort
View
SIGMOD
2007
ACM
111views Database» more  SIGMOD 2007»
14 years 7 months ago
Query relaxation using malleable schemas
In contrast to classical databases and IR systems, real-world information systems have to deal increasingly with very vague and diverse structures for information management and s...
Xuan Zhou, Julien Gaugaz, Wolf-Tilo Balke, Wolfgan...
WWW
2010
ACM
14 years 2 months ago
Identifying spam link generators for monitoring emerging web spam
In this paper, we address the question of how we can identify hosts that will generate links to web spam. Detecting such spam link generators is important because almost all new s...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw...
CIKM
2009
Springer
14 years 2 months ago
Improving retrievability of patents with cluster-based pseudo-relevance feedback documents selection
High findability of documents within a certain cut-off rank is considered an important factor in recall-oriented application domains such as patent or legal document retrieval. ...
Shariq Bashir, Andreas Rauber
CIKM
2007
Springer
14 years 1 months ago
"More like these": growing entity classes from seeds
We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to define a membersh...
Luís Sarmento, Valentin Jijkoun, Maarten de...
CIKM
2005
Springer
14 years 1 months ago
Novelty detection based on sentence level patterns
The detection of new information in a document stream is an important component of many potential applications. In this paper, a new novelty detection approach based on the identi...
Xiaoyan Li, W. Bruce Croft