Sciweavers

ICDAR
2007
IEEE
14 years 2 months ago
Example-Based Logical Labeling of Document Title Page Images
This paper presents a flexible and effective examplebased approach for labeling title pages which can be used for automated extraction of bibliographic data. The labels of intere...
Joost van Beusekom, Daniel Keysers, Faisal Shafait...
ICDAR
2007
IEEE
14 years 2 months ago
Automatic Ground-truth Generation for Document Image Analysis and Understanding
Performance evaluation for document image analysis and understanding is a recurring problem. Many groundtruthed document image databases are now used to evaluate general algorithm...
Pierre Héroux, Eugen Barbu, Sébastie...
DOCENG
2008
ACM
14 years 2 months ago
Matching XML documents in highly dynamic applications
Adrovane M. Kade, Carlos A. Heuser
DOCENG
2008
ACM
14 years 2 months ago
Configurable editing of XML-based variable-data documents
: Variable data documents can be considered as functions of their bindings to values, and this function could be arbitrarily complex to build strongly-customised but high-value doc...
John Lumley, Roger Gimson, Owen Rees
DOCENG
2008
ACM
14 years 2 months ago
Similarity of XML schema definitions
In this paper we propose a technique for evaluating similarity of XML Schema fragments. Firstly, we define classes of structurally and semantically equivalent XSD constructs. Then...
Irena Mlýnková
DOCENG
2008
ACM
14 years 2 months ago
Interactive office documents: a new face for web 2.0 applications
As the world wide web transforms from a vehicle of information dissemination and e-commerce transactions into a writable nexus of human collaboration, the Web 2.0 technologies at ...
John M. Boyer
DOCENG
2008
ACM
14 years 2 months ago
Malan: a mapping language for the data manipulation
Malan is a MApping LANguage that allows the generation of transformation programs by specifying a schema mapping between a source and target data schema. By working at the schema ...
Arnaud Blouin, Olivier Beaudoux, Stéphane L...
DOCENG
2008
ACM
14 years 2 months ago
Merging changes in XML documents using reliable context fingerprints
Different dialects of XML have emerged as ubiquitous document exchange formats. For effective collaboration based on such documents, the capability to propagate edit operations pe...
Sebastian Rönnau, Christian Pauli, Uwe M. Bor...
DOCENG
2008
ACM
14 years 2 months ago
Keeping a digital library clean: new solutions to old problems
Digital Libraries are complex information systems that involve rich sets of digital objects and their respective metadata, along with multiple organizational structures and servic...
Alberto H. F. Laender, Marcos André Gon&cce...
DOCENG
2008
ACM
14 years 2 months ago
Identifying and expanding titles in web texts
In this paper, we present an analysis based on linguistic and typographic features that allows for the identification of titles in web documents. We focus in particular on procedu...
Clémentine Adam, Estelle Delpech, Patrick S...