Sciweavers

257 search results - page 5 / 52
» Text extraction from graphical document images using sparse ...
Sort
View
DMIN
2006
150views Data Mining» more  DMIN 2006»
13 years 9 months ago
Effect of Document Representation on the Performance of Medical Document Classification
Text classification in the medical domain is a real world problem with wide applicability. This paper investigates extensively the effect of text representation approaches on the p...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
DOCENG
2009
ACM
14 years 2 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu
ICPR
2000
IEEE
14 years 8 months ago
Extraction of Relevant Information from Document Images Using Measures of Visual Attention
This paper describes an approach to attention based layout segmentation using general principles of the human visual perception to achieve this goal. The text is considered as tex...
Gerd Maderlechner, Angela Schreyer, Peter Suda
PRICAI
2000
Springer
13 years 11 months ago
Text Retrieval from Document Images based on N-Gram Algorithm
In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm. We directly extract image features instead of us...
Chew Lim Tan, Sam Yuan Sung, Zhaohui Yu, Yi Xu
ICDAR
1997
IEEE
13 years 12 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari