Sciweavers

684 search results - page 10 / 137
» Extracting semantic structure of web documents using content...
Sort
View
DOCENG
2004
ACM
14 years 2 months ago
The lifecycle of a digital historical document: structure and content
This paper describes the lifecycle of a digital historical document, from template-based structure definition through to content extraction from the scanned pages and its final re...
Apostolos Antonacopoulos, Dimosthenis Karatzas, He...
DOCENG
2008
ACM
13 years 10 months ago
A concise XML binding framework facilitates practical object-oriented document engineering
Semantic web researchers tend to assume that XML Schema and OWL-S are the correct means for representing the types, structure, and semantics of XML data used for documents and int...
Andruid Kerne, Zachary O. Toups, Blake Dworaczyk, ...
DL
2000
Springer
210views Digital Library» more  DL 2000»
14 years 1 months ago
Extracting and visualizing semantic structures in retrieval results for browsing
The paper introduces an approach that organizes retrieval results semantically and displays them spatially for browsing. Latent Semantic Analysis as well as cluster techniques are...
Katy Börner
AAAI
2008
13 years 11 months ago
Extracting Relevant Snippets for Web Navigation
Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extr...
Qing Li, K. Selçuk Candan, Qi Yan
WWW
2009
ACM
14 years 9 months ago
Estimating web site readability using content extraction
Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality ...
Thomas Gottron, Ludger Martin