Sciweavers

191 search results - page 8 / 39
» An XML-based Document Suite
Sort
View
ICDAR
2009
IEEE
14 years 5 months ago
Metadata Extraction from PDF Papers for Digital Library Ingest
In this paper we analyze our recent research on the use of document analysis techniques for metadata extraction from PDF papers. We describe a package that is designed to extract ...
Simone Marinai
DOCENG
2003
ACM
14 years 4 months ago
Using SVG as the rendering model for structured and graphically complex web material
This paper reports some experiments in using SVG (Scalable Vector Graphics), rather than the browser default of (X)HTML/CSS, as a potential Web-based rendering technology, in an a...
Julius C. Mong, David F. Brailsford
ICDAR
2009
IEEE
14 years 5 months ago
Constant-Time Locally Optimal Adaptive Binarization
Scanned document images are nowadays becoming available in increasingly higher resolutions. Meanwhile, the variations in image quality within typical document collections increase...
Iuliu Konya Konya, Christoph Seibert, Stefan Eicke...
MHCI
2004
Springer
14 years 4 months ago
Automatic Partitioning of Web Pages Using Clustering
This paper introduces a method for automatically partitioning richly-formatted electronic documents. An automatic partitioning system has many potential uses, but we focus here on ...
Richard Romero, Adam Berger
CORR
2010
Springer
145views Education» more  CORR 2010»
13 years 11 months ago
Random Indexing K-tree
Random Indexing K-tree is the combination of two algorithms suited for large scale document clustering. Keywords Random Indexing, K-tree, Dimensionality Reduction, B-tree, Search T...
Christopher M. De Vries, Lance De Vine, Shlomo Gev...