Sciweavers

218 search results - page 9 / 44
» Crawling for Images on the WWW
Sort
View
WWW
2008
ACM
14 years 8 months ago
Incremental web page template detection
Most template detection methods process web pages in batches that a newly crawled page can not be processed until enough pages have been collected. This results in large storage c...
Yu Wang, Binxing Fang, Xueqi Cheng, Li Guo, Hongbo...
WWW
2003
ACM
14 years 8 months ago
Automatic Profile Generation in eRACE
In this paper, we describe the design of a profile generator toolkit, which aims to automatically create realistic user profiles for a mobile personalized portal service. These pr...
Christiana Christophi, Marios D. Dikaiakos
WWW
2002
ACM
14 years 8 months ago
Parallel crawlers
In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...
Junghoo Cho, Hector Garcia-Molina
MVA
1998
134views Computer Vision» more  MVA 1998»
13 years 9 months ago
Orientation and Scale Invariant Text Region Extraction in WWW Images
Text extraction from a web image is important for web indexing because the text can contain a key information of the web. This paper presents a method to detect a text with variou...
Taehoon Park, Dongsung Kim, Kyusik Chung
APWEB
2005
Springer
14 years 1 months ago
Indexing Text and Visual Features for WWW Images
In this paper, we present a novel indexing technique called Multi-scale Similarity Indexing (MSI) to index image’s multi-features into a single one-dimensional structure. Both f...
Heng Tao Shen, Xiaofang Zhou, Bin Cui