Search Sciweavers | Sciweavers

35

CIDR
2011

243views Algorithms» more CIDR 2011»

Longitudinal Analytics on Web Archive Data: It's About Time!

12 years 11 months ago

Organizations like the Internet Archive have been capturing Web contents over decades, building up huge repositories of time-versioned pages. The timestamp annotations and the she...

Gerhard Weikum, Nikos Ntarmos, Marc Spaniol, Peter...

claim paper

Read More »

18

click to vote

ELPUB
1998
ACM

97views Information Technology» more ELPUB 1998»

Research Information Take Away or How to Serve Research Information Fast and Friendly on the Web

13 years 11 months ago

Download elpub.scix.net

In 1997 the library department at the University of Karlskrona/Ronneby was asked to develop a database which could be used to collate and present all the research material and ong...

Peter Linde, Leif Lagebrand

claim paper

Read More »

33

click to vote

JCDL
2011
ACM

301views Education» more JCDL 2011»

Archiving the web using page changes patterns: a case study

12 years 10 months ago

Download www-poleia.lip6.fr

A pattern is a model or a template used to summarize and describe the behavior (or the trend) of a data having generally some recurrent events. Patterns have received a considerab...

Myriam Ben Saad, Stéphane Gançarski

claim paper

Read More »

22

click to vote

SIGMETRICS
2000
ACM

117views Hardware» more SIGMETRICS 2000»

Crawler-Friendly Web Servers

13 years 7 months ago

Download oak.cs.ucla.edu

In this paper we study how to make web servers (e.g., Apache) more crawler friendly. Current web servers offer the same interface to crawlers and regular web surfers, even though ...

Onn Brandman, Junghoo Cho, Hector Garcia-Molina, N...

claim paper

Read More »

26

click to vote

CIKM
2009
Springer

140views Information Technology» more CIKM 2009»

Compact full-text indexing of versioned document collections

14 years 2 months ago

Download cis.poly.edu

We study the problem of creating highly compressed fulltext index structures for versioned document collections, that is, collections that contain multiple versions of each docume...

Jinru He, Hao Yan, Torsten Suel

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers