Sciweavers

915 search results - page 173 / 183
» Ranking the web frontier
Sort
View
CIKM
2011
Springer
12 years 7 months ago
Factorization-based lossless compression of inverted indices
Many large-scale Web applications that require ranked top-k retrieval are implemented using inverted indices. An inverted index represents a sparse term-document matrix, where non...
George Beskales, Marcus Fontoura, Maxim Gurevich, ...
SIGMOD
2012
ACM
234views Database» more  SIGMOD 2012»
11 years 10 months ago
SOFIA SEARCH: a tool for automating related-work search
When working on a new project, researchers need to devote a significant amount of time and effort to surveying the relevant literature. This is required in order to gain experti...
Behzad Golshan, Theodoros Lappas, Evimaria Terzi
WWW
2006
ACM
14 years 8 months ago
Using annotations in enterprise search
A major difference between corporate intranets and the Internet is that in intranets the barrier for users to create web pages is much higher. This limits the amount and quality o...
Pavel A. Dmitriev, Nadav Eiron, Marcus Fontoura, E...
WWW
2005
ACM
14 years 8 months ago
A personalized search engine based on web-snippet hierarchical clustering
In this paper we propose a hierarchical clustering engine, called SnakeT, that is able to organize on-the-fly the search results drawn from 16 commodity search engines into a hier...
Paolo Ferragina, Antonio Gulli
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 8 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar