Sciweavers

2189 search results - page 89 / 438
» Webbed documents
Sort
View
DOCENG
2009
ACM
14 years 4 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
CORR
2010
Springer
43views Education» more  CORR 2010»
13 years 9 months ago
Vcache: Caching Dynamic Documents
---- The traditional web caching is currently limited to static documents only. A page generated on the fly from a server side script may have different contents on different acces...
Vipul Goyal, Sugata Sanyal, Dharma P. Agrawal
WWW
2005
ACM
14 years 10 months ago
Making RDF presentable: integrated global and local semantic Web browsing
This paper discusses generating document structure from annotated media repositories in a domain-independent manner. This approaches the vision of a universal RDF browser. We star...
Lloyd Rutledge, Jacco van Ossenbruggen, Lynda Hard...
CIKM
2009
Springer
14 years 4 months ago
Compact full-text indexing of versioned document collections
We study the problem of creating highly compressed fulltext index structures for versioned document collections, that is, collections that contain multiple versions of each docume...
Jinru He, Hao Yan, Torsten Suel
DOCENG
2008
ACM
13 years 12 months ago
Identifying and expanding titles in web texts
In this paper, we present an analysis based on linguistic and typographic features that allows for the identification of titles in web documents. We focus in particular on procedu...
Clémentine Adam, Estelle Delpech, Patrick S...