Sciweavers

472 search results - page 18 / 95
» Crawling the Hidden Web
Sort
View
WWW
2007
ACM
14 years 8 months ago
First-order focused crawling
This paper reports a new general framework of focused web crawling based on "relational subgroup discovery". Predicates are used explicitly to represent the relevance cl...
Qingyang Xu, Wanli Zuo
WWW
2010
ACM
14 years 2 months ago
RESTler: crawling RESTful services
Service descriptions allow designers to document, understand, and use services, creating new useful and complex services with aggregated business value. Unlike RPC-based services,...
Rosa Alarcón, Erik Wilde
ECAI
2008
Springer
13 years 9 months ago
Reinforcement Learning with Classifier Selection for Focused Crawling
Focused crawlers are programs that wander in the Web, using its graph structure, and gather pages that belong to a specific topic. The most critical task in Focused Crawling is the...
Ioannis Partalas, Georgios Paliouras, Ioannis P. V...
IEEECIT
2007
IEEE
14 years 1 months ago
SiteRank-Based Crawling Ordering Strategy for Search Engines
Search engines are playing a more and more important role in discovering information nowadays. Due to limitations of time-consuming, network bandwidth and hardwares, we cannot obt...
Qiancheng Jiang, Yan Zhang
PDP
2008
IEEE
14 years 2 months ago
Bulk-Synchronous On-Line Crawling on Clusters of Computers
This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...
Mauricio Marín, Carolina Bonacic