Sciweavers

251 search results - page 28 / 51
» Document image analysis for digital libraries
Sort
View
166
Voted
ICDAR
2011
IEEE
14 years 3 months ago
Ternary Entropy-Based Binarization of Degraded Document Images Using Morphological Operators
—A vast number of historical and badly degraded document images can be found in libraries, public, and national archives. Due to the complex nature of different artifacts, such p...
T. Hoang Ngan Le, Tien D. Bui, Ching Y. Suen
SAMT
2007
Springer
108views Multimedia» more  SAMT 2007»
15 years 9 months ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, ext...
Claudio Andreatta
103
Voted
ICDAR
2007
IEEE
15 years 9 months ago
Quantile Linear Algorithm for Robust Binarization of Digitalized Letters
We describe a threshold-based local algorithm for image binarization. The main idea is to compute a transition energy using pixel value differences taken from a neighborhood aroun...
M. Ramírez, Ernesto Tapia, Marco Block, Ra&...
161
Voted
DRR
2009
15 years 1 months ago
Enriching a document collection by integrating information extraction and PDF annotation
Modern digital libraries offer all the hyperlinking possibilities of the World Wide Web: when a reader finds a citation of interest, in many cases she can now click on a link to b...
Brett Powley, Robert Dale, Ilya Anisimoff
138
Voted
DOCENG
2009
ACM
15 years 10 months ago
Test collection management and labeling system
In order to evaluate the performance of information retrieval and extraction algorithms, we need test collections. A test collection consists of a set of documents, a clearly form...
Eunyee Koh, Andruid Kerne, Sarah Berry