Sciweavers

2189 search results - page 353 / 438
» Webbed documents
Sort
View
104
Voted
WEBI
2010
Springer
15 years 11 days ago
On Using Query Logs for Static Index Pruning
Static index pruning techniques aim at removing from the posting lists of an inverted file the references to documents which are likely to be not relevant for answering user querie...
Hoang Thanh Lam, Raffaele Perego, Fabrizio Silvest...
CIKM
2011
Springer
14 years 2 months ago
Probabilistic near-duplicate detection using simhash
This paper offers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...
Sadhan Sood, Dmitri Loguinov
CIKM
2011
Springer
14 years 2 months ago
Factorization-based lossless compression of inverted indices
Many large-scale Web applications that require ranked top-k retrieval are implemented using inverted indices. An inverted index represents a sparse term-document matrix, where non...
George Beskales, Marcus Fontoura, Maxim Gurevich, ...
132
Voted
ELPUB
2006
ACM
15 years 8 months ago
Serving Innovation in Scholarly Communication with the Open Platform "Digital Peer Publishing"
The internet causes a continuous emergence of novel forms of scholarly communication and collaboration. Electronic publishing provides a means for representing eventual outcomes o...
Wolfram Horstmann, Peter Reimer, Jochen Schirrwage...
121
Voted
SEMWEB
2005
Springer
15 years 8 months ago
A Bayesian Network Approach to Ontology Mapping
This paper presents our ongoing effort on developing a principled methodology for automatic ontology mapping based on BayesOWL, a probabilistic framework we developed for modeling ...
Rong Pan, Zhongli Ding, Yang Yu, Yun Peng