Sciweavers

843 search results - page 22 / 169
» Segmentation of Compressed Documents
Sort
View
JCP
2007
101views more  JCP 2007»
13 years 7 months ago
Schema-Based Compression of XML Data with Relax NG
Abstract— The extensible markup language XML has become indispensable in many areas, but a significant disadvantage is its size: tagging a set of data increases the space needed...
Christopher League, Kenjone Eng
ICDAR
2007
IEEE
14 years 2 months ago
An Efficient Word Segmentation Technique for Historical and Degraded Machine-Printed Documents
Word segmentation is a crucial step for segmentation-free document analysis systems and is used for creating an index based on word matching. In this paper, we propose a novel met...
Michael Makridis, N. Nikolaou, Basilios Gatos
ICDAR
1995
IEEE
13 years 11 months ago
Ground-truthing and benchmarking document page segmentation
We describe a new approach for evaluating page segmentation algorithms. Unlike techniques that rely on OCR output, our method is region-based: the segmentation output, described a...
Berrin A. Yanikoglu, Luc Vincent
ICIP
2001
IEEE
14 years 9 months ago
Word shape recognition for image-based document retrieval
In this paper, we propose a word shape recognition method for retrieving image-based documents. Document images are segmented at the word level first. Then the proposed method det...
Weihua Huang, Chew Lim Tan, Sam Yuan Sung, Yi Xu
LREC
2008
70views Education» more  LREC 2008»
13 years 9 months ago
An Approach to Modeling Heterogeneous Resources for Information Extraction
In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better under...
Lei Xia, José Iria