Sciweavers

222 search results - page 22 / 45
» Ancient document analysis based on text line extraction
Sort
View
ACL
1992
13 years 9 months ago
SEXTANT: Exploring Unexplored Contexts for Semantic Extraction from Syntactic Analysis
For a very long time, it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists ...
Gregory Grefenstette
ICDAR
2009
IEEE
14 years 2 months ago
The GERMANA Database
A new handwritten text database, GERMANA, is presented to facilitate empirical comparison of different approaches to text line extraction and off-line handwriting recognition. G...
Daniel Pérez, Lionel Tarazón, Nicol&...
ICDAR
1997
IEEE
14 years 7 hour ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
ICDAR
2009
IEEE
13 years 5 months ago
Analysis of Book Documents' Table of Content Based on Clustering
Table of contents (TOC) recognition has attracted a great deal of attention in recent years. After reviewing the merits and drawbacks of the existing TOC recognition methods, we h...
Liangcai Gao, Zhi Tang, Xiaofan Lin, Xin Tao, Yimi...
ICPR
2000
IEEE
14 years 6 days ago
Automatic Ground-Truth Generation for Skew-Tolerance Evaluation of Document Layout Analysis Methods
Generation of ground-truths is of great importance for unbiased performance evaluation of document layout analysis methods. This is especially necessary because many methods are c...
Oleg Okun, Matti Pietikäinen