Sciweavers

1507 search results - page 135 / 302
» Archiving scientific data
Sort
View
AUSDM
2006
Springer
97views Data Mining» more  AUSDM 2006»
14 years 1 months ago
Tracking the Changes of Dynamic Web Pages in the Existence of URL Rewriting
Crawlers in a knowledge management system need to collect and archive documents from websites, and also track the change status of these documents. However, the existence of URL r...
Ping-Jer Yeh, Jie-Tsung Li, Shyan-Ming Yuan
INCDM
2010
Springer
125views Data Mining» more  INCDM 2010»
13 years 11 months ago
Web-Site Boundary Detection
Defining the boundaries of a web-site, for (say) archiving or information retrieval purposes, is an important but complicated task. In this paper a web-page clustering approach to...
Ayesh Alshukri, Frans Coenen, Michele Zito
RAID
1999
Springer
14 years 1 months ago
Audit logs: to keep or not to keep?
We approached this line of inquiry by questioning the conventional wisdom that audit logs are too large to be analyzed and must be reduced and filtered before the data can be anal...
Christopher Wee
FAST
2010
13 years 11 months ago
Bimodal Content Defined Chunking for Backup Streams
Data deduplication has become a popular technology for reducing the amount of storage space necessary for backup and archival data. Content defined chunking (CDC) techniques are w...
Erik Kruus, Cristian Ungureanu, Cezary Dubnicki
SIGMOD
2007
ACM
108views Database» more  SIGMOD 2007»
14 years 9 months ago
Provenance in databases
The provenance of data has recently been recognized as central to the trust one places in data. It is also important to annotation, to data integration and to probabilistic databa...
Peter Buneman, Wang Chiew Tan