Sciweavers

10500 search results - page 142 / 2100
» Documentation for
Sort
View
SIGIR
2004
ACM
14 years 1 months ago
A search engine for imaged documents in PDF files
Large quantities of documents in the Internet and digital libraries are simply scanned and archived in image format, many of which are packed in PDF files. The word search tool pr...
Yue Lu, Li Zhang, Chew Lim Tan
IEAAIE
2004
Springer
14 years 1 months ago
Incremental Induction of Classification Rules for Cultural Heritage Documents
This work presents the application of a first-order logic incremental learning system, INTHELEX, to learn rules for the automatic identification of a wide range of significant docu...
Teresa Maria Altomare Basile, Stefano Ferilli, Nic...
SPIRE
2004
Springer
14 years 1 months ago
Indexing Text Documents Based on Topic Identification
This work provides algorithms and heuristics to index text documents by determining important topics in the documents. To index text documents, the work provides algorithms to gene...
Manonton Butarbutar, Susan McRoy
ICDAR
2003
IEEE
14 years 1 months ago
Word Searching in CCITT Group 4 Compressed Document Images
In this paper, we present a compressed pattern matching method for searching user queried words in the CCITT Group 4 compressed document images, without decompressing. The feature...
Yue Lu, Chew Lim Tan
ICDAR
2003
IEEE
14 years 1 months ago
Comparison of Some Thresholding Algorithms for Text/Background Segmentation in Difficult Document Images
A number of techniques have previously been proposed for effective thresholding of document images. In this paper two new thresholding techniques are proposed and compared against...
Graham Leedham, Yan Chen, Kalyan Takru, Joie Hadi ...