Search Sciweavers | Sciweavers

48 search results - page 5 / 10

» Language Based Crawling: Crawling the Arabic Content of the ...

click to vote

WWW
2008
ACM

109views Internet Technology» more WWW 2008»

Recrawl scheduling based on information longevity

14 years 7 months ago

Download www2008.org

It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...

Christopher Olston, Sandeep Pandey

claim paper

Read More »

click to vote

WWW
2009
ACM

128views Internet Technology» more WWW 2009»

Detecting soft errors by redirection classification

14 years 7 months ago

Download www2009.eprints.org

A soft error redirection is a URL redirection to a page that returns the HTTP status code 200 (OK) but has actually no relevant content to the client request. Since such redirecti...

Taehyung Lee, Jinil Kim, Jin Wook Kim, Sung-Ryul K...

claim paper

Read More »

click to vote

WWW
2011
ACM

198views Internet Technology» more WWW 2011»

we.b: the web of short urls

13 years 1 months ago

Download research.microsoft.com

Short URLs have become ubiquitous. Especially popular within social networking services, short URLs have seen a signiﬁcant increase in their usage over the past years, mostly du...

Demetres Antoniades, Iasonas Polakis, Georgios Kon...

claim paper

Read More »

click to vote

WWW
2004
ACM

179views Internet Technology» more WWW 2004»

Combining link and content analysis to estimate semantic similarity

14 years 7 months ago

Download www.informatics.indiana.edu

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic ass...

Filippo Menczer

claim paper

Read More »

click to vote

SAC
2005
ACM

124views Applied Computing» more SAC 2005»

A distributed content-based search engine based on mobile code

14 years 15 days ago

Download semoa.sourceforge.net

Current search engines crawl the Web, download content, and digest this content locally. For multimedia content, this involves considerable volumes of data. Furthermore, this proc...

Volker Roth, Ulrich Pinsdorf, Jan Peters

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers