Sciweavers

1216 search results - page 35 / 244
» Optimizing web search using web click-through data
Sort
View
PAKM
2004
13 years 9 months ago
Automatic Generation of Taxonomies from the WWW
In this paper we present a methodology to extract information from the Web to build a taxonomy of terms and Web resources for a given domain. This taxonomy represents a hierarchy o...
David Sánchez, Antonio Moreno
SIGIR
2008
ACM
13 years 8 months ago
Using parsimonious language models on web data
In this paper we explore the use of parsimonious language models for web retrieval. These models are smaller thus more efficient than the standard language models and are therefor...
Rianne Kaptein, Rongmei Li, Djoerd Hiemstra, Jaap ...
JCDL
2009
ACM
102views Education» more  JCDL 2009»
14 years 3 months ago
Unsupervised creation of small world networks for the preservation of digital objects
The prevailing model for digital preservation is that archives should be similar to a “fortress”: a large, protective infrastructure built to defend a relatively small collect...
Charles L. Cartledge, Michael L. Nelson
ICDE
2012
IEEE
227views Database» more  ICDE 2012»
11 years 11 months ago
Temporal Analytics on Big Data for Web Advertising
—“Big Data” in map-reduce (M-R) clusters is often fundamentally temporal in nature, as are many analytics tasks over such data. For instance, display advertising uses Behavio...
Badrish Chandramouli, Jonathan Goldstein, Songyun ...
WWW
2011
ACM
13 years 3 months ago
Parallel boosted regression trees for web search ranking
Gradient Boosted Regression Trees (GBRT) are the current state-of-the-art learning paradigm for machine learned websearch ranking — a domain notorious for very large data sets. ...
Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal...