Sciweavers

1109 search results - page 21 / 222
» Crawling on web graphs
Sort
View
WWW
2007
ACM
16 years 4 months ago
First-order focused crawling
This paper reports a new general framework of focused web crawling based on "relational subgroup discovery". Predicates are used explicitly to represent the relevance cl...
Qingyang Xu, Wanli Zuo
WWW
2010
ACM
15 years 11 months ago
RESTler: crawling RESTful services
Service descriptions allow designers to document, understand, and use services, creating new useful and complex services with aggregated business value. Unlike RPC-based services,...
Rosa Alarcón, Erik Wilde
112
Voted
IEEECIT
2007
IEEE
15 years 10 months ago
SiteRank-Based Crawling Ordering Strategy for Search Engines
Search engines are playing a more and more important role in discovering information nowadays. Due to limitations of time-consuming, network bandwidth and hardwares, we cannot obt...
Qiancheng Jiang, Yan Zhang
PDP
2008
IEEE
15 years 10 months ago
Bulk-Synchronous On-Line Crawling on Clusters of Computers
This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...
Mauricio Marín, Carolina Bonacic
CIKM
2011
Springer
14 years 3 months ago
Focusing on novelty: a crawling strategy to build diverse language models
Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...
Luciano Barbosa, Srinivas Bangalore