Sciweavers

295 search results - page 18 / 59
» Web Crawling
Sort
View
ECIR
2006
Springer
13 years 11 months ago
Efficient Parallel Computation of PageRank
Abstract. PageRank inherently is massively parallelizable and distributable, as a result of web's strict host-based link locality. In this paper we show that the Gau
Christian Kohlschütter, Paul-Alexandru Chirit...
ICML
2007
IEEE
14 years 10 months ago
Focused crawling with scalable ordinal regression solvers
In this paper we propose a novel, scalable, clustering based Ordinal Regression formulation, which is an instance of a Second Order Cone Program (SOCP) with one Second Order Cone ...
Rashmin Babaria, J. Saketha Nath, S. Krishnan, K. ...
ECAI
2008
Springer
13 years 11 months ago
Reinforcement Learning with Classifier Selection for Focused Crawling
Focused crawlers are programs that wander in the Web, using its graph structure, and gather pages that belong to a specific topic. The most critical task in Focused Crawling is the...
Ioannis Partalas, Georgios Paliouras, Ioannis P. V...
INTR
2002
50views more  INTR 2002»
13 years 9 months ago
Methodologies for crawler based Web surveys
There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...
Mike Thelwall
CVPR
2011
IEEE
13 years 6 months ago
Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds
Active learning and crowdsourcing are promising ways to efficiently build up training sets for object recognition, but thus far techniques are tested in artificially controlled ...
Sudheendra Vijayanarasimhan, Kristen Grauman