Sciweavers

180 search results - page 23 / 36
» Document Page Segmentation Using Multiscale Clustering
Sort
View
ICPR
2008
IEEE
14 years 5 months ago
Ancient document analysis based on text line extraction
In order to preserve our cultural heritage and for automated document processing libraries and national archives have started digitizing historical documents. In the case of degra...
Florian Kleber, Robert Sablatnig, Melanie Gau, Hei...
ICAPR
2001
Springer
14 years 3 months ago
Character Extraction from Interfering Background - Analysis of Double-Sided Handwritten Archival Documents
The sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This pape...
Chew Lim Tan, Ruini Cao, Qian Wang, Peiyi Shen
ICDAR
1995
IEEE
14 years 2 months ago
A Hough based algorithm for extracting text lines in handwritten documents
The method herein proposed detects text lines on handwritten pages which may include either lines oriented in several directions, erasures, or annotationsbetween main lines. The m...
Laurence Likforman-Sulem, Anahid Hanimyan, Claudie...
ICPR
2004
IEEE
15 years 14 hour ago
Serialized Unsupervised Classifier for Adaptative Color Image Segmentation: Application to Digitized Ancient Manuscripts
This paper presents an adaptative algorithm for the segmentation of color images suited for document image analysis. The algorithm is based on a serialization of the k-means algor...
Frank Le Bourgeois, Hubert Emptoz, Yann Leydier
WWW
2008
ACM
14 years 11 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev