Sciweavers

611 search results - page 14 / 123
» Random web crawls
Sort
View
HICSS
1999
IEEE
178views Biometrics» more  HICSS 1999»
14 years 18 hour ago
Collaborative Web Crawling: Information Gathering/Processing over Internet
The main objective of the IBM Grand Central Station (GCS) is to gather information of virtually any type of formats (text, data, image, graphics, audio, video) from the cyberspace...
Shang-Hua Teng, Qi Lu, Matthias Eichstaedt, Daniel...
WWW
2009
ACM
14 years 8 months ago
Crawling English-Japanese person-name transliterations from the web
Automatic compilation of lexicon is a dream of lexicon compilers as well as lexicon users. This paper proposes a system that crawls English-Japanese person-name transliterations f...
Satoshi Sato
VLDB
2004
ACM
113views Database» more  VLDB 2004»
14 years 1 months ago
Accurate and Efficient Crawling for Relevant Websites
Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there ar...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...
ICCSA
2007
Springer
14 years 1 months ago
Crawling the Content Hidden Behind Web Forms
The crawler engines of today cannot reach most of the information contained in the Web. A great amount of valuable information is “hidden” behind the query forms of online data...
Manuel Álvarez, Juan Raposo, Alberto Pan, F...
ERCIMDL
2005
Springer
305views Education» more  ERCIMDL 2005»
14 years 1 months ago
Focused Crawling Using Latent Semantic Indexing - An Application for Vertical Search Engines
Vertical search engines and web portals are gaining ground over the general-purpose engines due to their limited size and their high precision for the domain they cover. The number...
George Almpanidis, Constantine Kotropoulos, Ioanni...