Sciweavers

395 search results - page 59 / 79
» An Automatic Data Grabber for Large Web Sites
Sort
View
WSDM
2012
ACM
236views Data Mining» more  WSDM 2012»
12 years 3 months ago
Effective query formulation with multiple information sources
Most standard information retrieval models use a single source of information (e.g., the retrieval corpus) for query formulation tasks such as term and phrase weighting and query ...
Michael Bendersky, Donald Metzler, W. Bruce Croft
CCGRID
2005
IEEE
14 years 1 months ago
ReGS: user-level reliability in a grid environment
Grid environments are ideal for executing applications that require a huge amount of computational work, both due to the big number of tasks to execute and to the large amount of ...
J. A. L. Sanches, Patrícia Kayser Vargas, I...
PVLDB
2008
124views more  PVLDB 2008»
13 years 7 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
CIKM
2007
Springer
14 years 1 months ago
Autonomously semantifying wikipedia
Berners-Lee’s compelling vision of a Semantic Web is hindered by a chicken-and-egg problem, which can be best solved by a bootstrapping method — creating enough structured dat...
Fei Wu, Daniel S. Weld
NSDI
2008
13 years 10 months ago
Ostra: Leveraging Trust to Thwart Unwanted Communication
Online communication media such as email, instant messaging, bulletin boards, voice-over-IP, and social networking sites allow any sender to reach potentially millions of users at...
Alan Mislove, Ansley Post, Peter Druschel, P. Kris...