Sciweavers

563 search results - page 37 / 113
» Crawling the web for structured documents
Sort
View
CAISE
2003
Springer
15 years 9 months ago
From State to Structure: an XML Web Publishing Framework
Abstract. We present the main features of a system designed to support the development and delivery of web applications through concepts for modularity, reuse and rapid prototyping...
Moira C. Norrie, Alexios Palinginis
120
Voted
IADIS
2004
15 years 5 months ago
Structuration and metadata for electronic library
The complexity of preserving the web is becoming one of the most important information and communication media. While the quantity of digital resources available through the web i...
Omar Larouk, Salah Dalhoumi
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
16 years 4 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
128
Voted
WWW
2003
ACM
16 years 4 months ago
Detecting web page structure for adaptive viewing on small form factor devices
Mobile devices have already been widely used to access the Web. However, because most available web pages are designed for desktop PC in mind, it is inconvenient to browse these l...
Yu Chen, Wei-Ying Ma, HongJiang Zhang
SAC
2000
ACM
15 years 8 months ago
A Synchronization Model for Hypermedia Documents Navigation
This paper presents a model for describing the synchronization between several media delivered over a network in a Web-based environment. Synchronization concerns the download and...
Augusto Celentano, Ombretta Gaggi