Search Sciweavers | Sciweavers

154

ISM
2008
IEEE

127views Multimedia» more ISM 2008»

LeeDeo: Web-Crawled Academic Video Search Engine

16 years 1 months ago

We present our vision and preliminary design toward web-crawled academic video search engine, named as LeeDeo, that can search, crawl, archive, index, and browse “academic” vi...

Dongwon Lee, Hung-sik Kim, Eun Kyung Kim, Su Yan, ...

claim paper

Read More »

185

click to vote

ICIW
2009
IEEE

124views Internet Technology» more ICIW 2009»

Utilizing RSS Feeds for Crawling the Web

15 years 4 months ago

Download ru6.cti.gr

We present "advaRSS" crawling mechanism which is created in order to support peRSSonal, a mechanism used to create personalized RSS feeds. In contrast to the common crawl...

George Adam, Christos Bouras, Vassilis Poulopoulos

claim paper

Read More »

194

Voted

KDD
2002
ACM

115views Data Mining» more KDD 2002»

Collaborative crawling: mining user experiences for topical resource discovery

16 years 7 months ago

Download charuaggarwal.net

The rapid growth of the world wide web had made the problem of topic speci c resource discovery an important one in recent years. In this problem, it is desired to nd web pages wh...

Charu C. Aggarwal

claim paper

Read More »

221

click to vote

WWW
2003
ACM

133views Internet Technology» more WWW 2003»

Efficient URL caching for world wide web crawling

16 years 7 months ago

Download research.microsoft.com

Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...

Andrei Z. Broder, Marc Najork, Janet L. Wiener

claim paper

Read More »

233

click to vote

IR
2008

189views Natural Language Processing» more IR 2008»

Focused web crawling in the acquisition of comparable corpora

15 years 7 months ago

Download www.info.uta.fi

CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...

Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers