Sciweavers

604 search results - page 73 / 121
» Segmentation of legal documents
Sort
View
DRR
2009
13 years 6 months ago
Text-image alignment for historical handwritten documents
We describe our work on text-image alignment in context of building a historical document retrieval system. We aim at aligning images of words in handwritten lines with their text...
Svitlana Zinger, John Nerbonne, Lambert Schomaker
ICIP
2002
IEEE
14 years 10 months ago
JPEG2000-matched MRC compression of compound documents
The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multilayer decomposition model for compound documents into two contone image layers and a binar...
Debargha Mukherjee, Christos Chrysafis, Amir Said
ICDAR
2009
IEEE
13 years 6 months ago
Document Content Extraction Using Automatically Discovered Features
We report an automatic feature discovery method that achieves results comparable to a manually chosen, larger feature set on a document image content extraction problem: the locat...
Sui-Yu Wang, Henry S. Baird, Chang An
WWW
2006
ACM
14 years 9 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
ICDAR
2009
IEEE
14 years 3 months ago
Classifying Foreground Pixels in Document Images
We present a system that classifies pixels in a document image according to marking type such as machine print, handwriting, and noise. A segmenter module first splits an input ...
Prateek Sarkar, Eric Saund, Jing Lin