Sciweavers

459 search results - page 8 / 92
» Random sampling from a search engine's index
Sort
View
JCDL
2011
ACM
225views Education» more  JCDL 2011»
12 years 11 months ago
How much of the web is archived?
The Memento Project’s archive access additions to HTTP have enabled development of new web archive access user interfaces. After experiencing this web time travel, the inevitabl...
Scott Ainsworth, Ahmed Alsum, Hany SalahEldeen, Mi...
ECML
2003
Springer
14 years 1 months ago
Optimising Performance of Competing Search Engines in Heterogeneous Web Environments
Abstract. Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-specific search engines provide search services, and metasearchers...
Rinat Khoussainov, Nicholas Kushmerick
FLAIRS
2007
13 years 10 months ago
Indexing Documents by Discourse and Semantic Contents from Automatic Annotations of Texts
The basic aim of the model proposed here is to automatically build semantic metatext structure for texts that would allow us to search and extract discourse and semantic informati...
Brahim Djioua, Jean-Pierre Desclés
WWW
2010
ACM
14 years 3 months ago
Sampling high-quality clicks from noisy click data
Click data captures many users’ document preferences for a query and has been shown to help significantly improve search engine ranking. However, most click data is noisy and of...
Adish Singla, Ryen W. White
ICDE
2008
IEEE
117views Database» more  ICDE 2008»
14 years 9 months ago
Mining Search-Phrase Definitions from Item Descriptions
In this paper, we develop a model for representing term dependence based on Markov Random Fields and present an approach based on Markov Chain Monte Carlo technique for generating ...
Hung V. Nguyen, Hasan Davulcu