Sciweavers

218 search results - page 7 / 44
» Crawling for Images on the WWW
Sort
View
WWW
2003
ACM
14 years 8 months ago
Efficient URL caching for world wide web crawling
Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...
Andrei Z. Broder, Marc Najork, Janet L. Wiener
MM
2004
ACM
133views Multimedia» more  MM 2004»
14 years 1 months ago
Intuitive and effective interfaces for WWW image search engines
Web image search engine has become an important tool to organize digital images on the Web. However, most commercial search engines still use a list presentation while little effo...
Zhiwei Li, Xing Xie, Hao Liu, Xiaoou Tang, Mingjin...
MIR
2004
ACM
125views Multimedia» more  MIR 2004»
14 years 1 months ago
Autonomous visual model building based on image crawling through internet search engines
In this paper, we propose an autonomous learning scheme to automatically build visual semantic concept models from the output data of Internet search engines without any manual la...
Xiaodan Song, Ching-Yung Lin, Ming-Ting Sun
WWW
2008
ACM
14 years 8 months ago
IRLbot: scaling to 6 billion pages and beyond
This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...
Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...
SEBD
2008
148views Database» more  SEBD 2008»
13 years 9 months ago
Crawling, Indexing, and Similarity Searching Images on the Web
Michal Batko, Fabrizio Falchi, Claudio Lucchese, D...