Sciweavers

DOCENG
2005
ACM
14 years 2 months ago
Managing syntactic variation in text retrieval
Information Retrieval systems are limited by the linguistic variation of language. The use of Natural Language Processing techniques to manage this problem has been studied for a ...
Jesús Vilares, Carlos Gómez-Rodr&iac...
DOCENG
2005
ACM
14 years 2 months ago
Generative semantic clustering in spatial hypertext
This paper presents an iterative method for generative semantic clustering of related information elements in spatial hypertext documents. The goal is to automatically organize th...
Andruid Kerne, Eunyee Koh, Vikram Sundaram, J. Mic...
DOCENG
2005
ACM
14 years 2 months ago
Enhancing composite digital documents using XML-based standoff markup
Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging tract structure to detailed rendering and layout. We pres...
Peter L. Thomas, David F. Brailsford
DOCENG
2005
ACM
14 years 2 months ago
A web-based document harmonization and annotation chain: from PDF to RDF
Thierry Jacquin, Olivier Fambon, Boris Chidlovskii
DOCENG
2005
ACM
14 years 2 months ago
Prefiltering techniques for efficient XML document processing
Chia-Hsin Huang, Tyng-Ruey Chuang, Hahn-Ming Lee
DOCENG
2005
ACM
14 years 2 months ago
Support for arbitrary regions in XSL-FO
Ana Cristina Benso da Silva, João Batista S...
DOCENG
2005
ACM
14 years 2 months ago
Injecting information into atomic units of text
This paper presents a new approach to text processing, based on textemes. These are atomic text units generalising the concepts of character and glyph by merging them in a common ...
Yannis Haralambous, Gábor Bella
DOCENG
2005
ACM
14 years 2 months ago
Compiling XPath for streaming access policy
We show how the full XPath language can be compiled into a minimal subset suited for stream-based evaluation. Specifically, we show how XPath normalization into a core language a...
Pierre Genevès, Kristoffer Høgsbro R...
DOCENG
2005
ACM
14 years 2 months ago
Towards active web clients
Recent developments of document technologies have strongly impacted the evolution of Web clients over the last fifteen years, but all Web clients have not taken the same advantag...
Vincent Quint, Irène Vatton
DOCENG
2005
ACM
14 years 2 months ago
Structuring documents according to their table of contents
In this paper, we present a method for structuring a document according to the information present in its Table of Contents. The detection of the ToC as well as the determination ...
Hervé Déjean, Jean-Luc Meunier