Sciweavers

459 search results - page 28 / 92
» Random sampling from a search engine's index
Sort
View
JASIS
2000
100views more  JASIS 2000»
13 years 8 months ago
Raising reliability of web search tool research through replication and chaos theory
: Because the World Wide Web is a dynamic collection of information, the Web search tools (or "search engines") that index the Web are dynamic. Traditional information re...
Scott Nicholson
IC
2003
13 years 9 months ago
An Analysis of Web Documents Retrieved and Viewed
The placement of Websites in ranked retrieval and the viewing patterns of Web search engine users is a crucial issue for Web site owners and Web search engines. However, little la...
Bernard J. Jansen, Amanda Spink
WWW
2006
ACM
14 years 2 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
SIGMOD
2000
ACM
141views Database» more  SIGMOD 2000»
14 years 25 days ago
Counting, Enumerating, and Sampling of Execution Plans in a Cost-Based Query Optimizer
Testing an SQL database system by running large sets of deterministic or stochastic SQL statements is common practice in commercial database development. However, code defects oft...
Florian Waas, César A. Galindo-Legaria
PAMI
2008
270views more  PAMI 2008»
13 years 8 months ago
Randomized Clustering Forests for Image Classification
This paper introduces three new contributions to the problems of image classification and image search. First, we propose a new image patch quantization algorithm. Other competitiv...
Frank Moosmann, Eric Nowak, Frédéric...