Search Sciweavers | Sciweavers

6 search results - page 1 / 2

» Evaluation of crawling policies for a web-repository crawler

click to vote

HT
2006
ACM

92views Internet Technology» more HT 2006»

Evaluation of crawling policies for a web-repository crawler

14 years 4 months ago

Download www.cs.odu.edu

We have developed a web-repository crawler that is used for reconstructing websites when backups are unavailable. Our crawler retrieves web resources from the Internet Archive, Go...

Frank McCown, Michael L. Nelson

claim paper

Read More »

click to vote

WIDM
2006
ACM

95views Internet Technology» more WIDM 2006»

Lazy preservation: reconstructing websites by crawling the crawlers

14 years 4 months ago

Download www.cs.odu.edu

Backup of websites is often not considered until after a catastrophic event has occurred to either the website or its webmaster. We introduce “lazy preservation” – digital p...

Frank McCown, Joan A. Smith, Michael L. Nelson

claim paper

Read More »

click to vote

CIKM
2011
Springer

259views Information Technology» more CIKM 2011»

Focusing on novelty: a crawling strategy to build diverse language models

12 years 10 months ago

Download www2.research.att.com

Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...

Luciano Barbosa, Srinivas Bangalore

claim paper

Read More »

click to vote

WWW
2008
ACM

103views Internet Technology» more WWW 2008»

Low-load server crawler: design and evaluation

14 years 11 months ago

Download www2008.org

This paper proposes a method of crawling Web servers connected to the Internet without imposing a high processing load. We are using the crawler for a field survey of the digital ...

Katsuko T. Nakahira, Tetsuya Hoshino, Yoshiki Mika...

claim paper

Read More »

click to vote

CORR
2012
Springer

292views Education» more CORR 2012»

Optimal Threshold Control by the Robots of Web Search Engines with Obsolescence of Documents

12 years 6 months ago

Download www-sop.inria.fr

A typical web search engine consists of three principal parts: crawling engine, indexing engine, and searching engine. The present work aims to optimize the performance of the cra...

Konstantin Avrachenkov, Alexander N. Dudin, Valent...

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers