Sciweavers

218 search results - page 5 / 44
» Crawling for Images on the WWW
Sort
View
WWW
2006
ACM
14 years 1 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
WWW
2009
ACM
14 years 8 months ago
Crawling English-Japanese person-name transliterations from the web
Automatic compilation of lexicon is a dream of lexicon compilers as well as lexicon users. This paper proposes a system that crawls English-Japanese person-name transliterations f...
Satoshi Sato
WWW
2001
ACM
14 years 8 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina
MICCAI
2004
Springer
14 years 8 months ago
Crawling on the Heart: A Mobile Robotic Device for Minimally Invasive Cardiac Interventions
Nicholas A. Patronik, Marco A. Zenati, Cameron N. ...
WWW
2002
ACM
14 years 8 months ago
Accelerated focused crawling through online relevance feedback
The organization of HTML into a tag tree structure, which is rendered by browsers as roughly rectangular regions with embedded text and HREF links, greatly helps surfers locate an...
Soumen Chakrabarti, Kunal Punera, Mallela Subraman...