Sciweavers

472 search results - page 12 / 95
» Crawling the Hidden Web
Sort
View
ISM
2008
IEEE
127views Multimedia» more  ISM 2008»
14 years 2 months ago
LeeDeo: Web-Crawled Academic Video Search Engine
We present our vision and preliminary design toward web-crawled academic video search engine, named as LeeDeo, that can search, crawl, archive, index, and browse “academic” vi...
Dongwon Lee, Hung-sik Kim, Eun Kyung Kim, Su Yan, ...
ICIW
2009
IEEE
13 years 5 months ago
Utilizing RSS Feeds for Crawling the Web
We present "advaRSS" crawling mechanism which is created in order to support peRSSonal, a mechanism used to create personalized RSS feeds. In contrast to the common crawl...
George Adam, Christos Bouras, Vassilis Poulopoulos
KDD
2002
ACM
115views Data Mining» more  KDD 2002»
14 years 8 months ago
Collaborative crawling: mining user experiences for topical resource discovery
The rapid growth of the world wide web had made the problem of topic speci c resource discovery an important one in recent years. In this problem, it is desired to nd web pages wh...
Charu C. Aggarwal
WWW
2003
ACM
14 years 8 months ago
Efficient URL caching for world wide web crawling
Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...
Andrei Z. Broder, Marc Najork, Janet L. Wiener
IR
2008
13 years 7 months ago
Focused web crawling in the acquisition of comparable corpora
CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...
Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...