Sciweavers

459 search results - page 45 / 92
» Random sampling from a search engine's index
Sort
View
SPIRE
2010
Springer
13 years 8 months ago
Hypergeometric Language Model and Zipf-Like Scoring Function for Web Document Similarity Retrieval
The retrieval of similar documents in the Web from a given document is different in many aspects from information retrieval based on queries generated by regular search engine use...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...
ICMCS
2009
IEEE
159views Multimedia» more  ICMCS 2009»
13 years 7 months ago
Large-scale near-duplicate web video search: Challenge and opportunity
The massive amount of near-duplicate and duplicate web videos has presented both challenge and opportunity to multimedia computing. On one hand, browsing videos on Internet become...
Wanlei Zhao, Song Tan, Chong-Wah Ngo
FSTTCS
2010
Springer
13 years 7 months ago
Average Analysis of Glushkov Automata under a BST-Like Model
We study the average number of transitions in Glushkov automata built from random regular expressions. This statistic highly depends on the probabilistic distribution set on the e...
Cyril Nicaud, Carine Pivoteau, Benoît Razet
SIGMOD
2004
ACM
184views Database» more  SIGMOD 2004»
14 years 10 months ago
Identifying Similarities, Periodicities and Bursts for Online Search Queries
We present several methods for mining knowledge from the query logs of the MSN search engine. Using the query logs, we build a time series for each query word or phrase (e.g., `Th...
Michail Vlachos, Christopher Meek, Zografoula Vage...
EKAW
2008
Springer
13 years 11 months ago
Principles for Knowledge Engineering on the Web
With the advent of the Web and the efforts towards a Semantic Web the nature of knowledge engineering has changed drastically. In this position paper we propose four principles fo...
Guus Schreiber