Sciweavers

843 search results - page 102 / 169
» Segmentation of Compressed Documents
Sort
View
ICAPR
2001
Springer
14 years 2 months ago
Character Extraction from Interfering Background - Analysis of Double-Sided Handwritten Archival Documents
The sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This pape...
Chew Lim Tan, Ruini Cao, Qian Wang, Peiyi Shen
DAS
2006
Springer
14 years 1 months ago
Language Identification in Degraded and Distorted Document Images
This paper presents a language identification technique that differentiates Latin-based languages in degraded and distorted document images. Different from the reported methods tha...
Shijian Lu, Chew Lim Tan, Weihua Huang
LREC
2008
141views Education» more  LREC 2008»
13 years 11 months ago
New Resources for Document Classification, Analysis and Translation Technologies
The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
Stephanie Strassel, Lauren Friedman, Safa Ismael, ...
CVPR
2006
IEEE
15 years 3 days ago
Searching Off-line Arabic Documents
Currently an abundance of historical manuscripts, journals, and scientific notes remain largely unaccessible in library archives. Manual transcription and publication of such docu...
Jim Chan, Celal Ziftci, David A. Forsyth
COLING
2000
13 years 11 months ago
The effects of analysing cohesion on document summarisation
We argue that in general, the analysis of lexical cohesion factors in a document can drive a summarizer, as well as enable other content characterization tasks. More narrowly, thi...
Branimir Boguraev, Mary S. Neff