Sciweavers

466 search results - page 25 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
ICDAR
2005
IEEE
14 years 3 months ago
A Model for Detecting and Merging Vertically Spanned Table Cells in Plain Text Documents
A spanned cell in a table is a single, complete unit that physically occupies multiple columns and/or multiple rows. Spanned cells are common in tables, and they are a significan...
Vanessa Long, Robert Dale, Steve Cassidy
ICDAR
1997
IEEE
14 years 2 months ago
Document image similarity and equivalence detection
A hierarchical algorithm is presented for determining the similarity and equivalence of document images. Features extracted from the CCIIT fax-compressed representations of two im...
Jonathan J. Hull, John F. Cullen
DMIN
2006
146views Data Mining» more  DMIN 2006»
13 years 11 months ago
A Comparison of Two Document Clustering Approaches for Clustering Medical Documents
Medical data is often presented as free text in the form of medical reports. Such documents contain important information about patients, disease progression and management, but ar...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
CAIP
2009
Springer
246views Image Analysis» more  CAIP 2009»
14 years 2 months ago
A Novel Approach for Word Spotting Using Merge-Split Edit Distance
Edit distance matching has been used in literature for word spotting with characters taken as primitives. The recognition rate however, is limited by the segmentation inconsistenci...
Khurram Khurshid, Claudie Faure, Nicole Vincent
ICPR
2000
IEEE
14 years 2 months ago
Structure Extraction from Various Kinds of Decorated Characters Using Multi-Scale Images
Decorated characters are widely used in various documents. Practical optical character reader is required to deal with not only common fonts but also complex designed fonts. Howev...
Shinichiro Omachi, Masaki Inoue, Hirotomo Aso