Sciweavers

311 search results - page 9 / 63
» XTRACT: A System for Extracting Document Type Descriptors fr...
Sort
View
DOCENG
2009
ACM
14 years 2 months ago
From rhetorical structures to document structure: shallow pragmatic analysis for document engineering
In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric ...
Gersende Georg, Hugo Hernault, Marc Cavazza, Helmu...
WWW
2003
ACM
14 years 8 months ago
The XML web: a first study
Although originally designed for large-scale electronic publishing, XML plays an increasingly important role in the exchange of data on the Web. In fact, it is expected that XML w...
Laurent Mignet, Denilson Barbosa, Pierangelo Veltr...
WWW
2003
ACM
14 years 8 months ago
Content extraction signatures using XML digital signatures and custom transforms on-demand
Content Extraction Signatures (CES) enable selective disclosure of verifiable content, provide privacy for blinded content, and enable the signer to specify the content the docume...
David Squire, Laurence Bull, Peter Stañski
WWW
2005
ACM
14 years 8 months ago
Interactive web-wrapper construction for extracting relational information from web documents
In this paper, we propose a new user interface to interactively specify Web wrappers to extract relational information from Web documents. In this study, we focused on improving u...
Tsuyoshi Sugibuchi, Yuzuru Tanaka
RIAO
2007
13 years 9 months ago
Using a Content-and-Structure Oriented Method for Relevance Feedback in XML Retrieval
As opposed to traditional Information Retrieval (IR) which views whole documents as atomic units of retrieval, XML IR processes XML elements as possible units of retrieval. Many o...
Lobna Hlaoua, Mohand Boughanem, Karen Pinel-Sauvag...