Sciweavers

295 search results - page 5 / 59
» Web Crawling
Sort
View
WWW
2004
ACM
14 years 9 months ago
Distributed community crawling
The massive distribution of the crawling task can lead to inefficient exploration of the same portion of the Web. We propose a technique to guide crawlers exploration based on the...
Fabrizio Costa, Paolo Frasconi
JUCS
2008
124views more  JUCS 2008»
13 years 8 months ago
Structure-Based Crawling in the Hidden Web
: The number of applications that need to crawl the Web to gather data is growing at an ever increasing pace. In some cases, the criterion to determine what pages must be included ...
Márcio L. A. Vidal, Altigran Soares da Silv...
WEBI
2009
Springer
14 years 3 months ago
Learning Deep Web Crawling with Diverse Features
—The key to Deep Web crawling is to submit promising keywords to query form and retrieve Deep Web content efficiently. To select keywords, existing methods make a decision based ...
Lu Jiang, Zhaohui Wu, Qinghua Zheng, Jun Liu
WIDM
2004
ACM
14 years 2 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen
SAC
2003
ACM
14 years 1 months ago
Ontology-Focused Crawling of Web Documents
The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed n...
Marc Ehrig, Alexander Maedche