Sciweavers

290 search results - page 26 / 58
» Document normalization revisited
Sort
View
ICDAR
2003
IEEE
14 years 1 months ago
Localization, Extraction and Recognition of Text in Telugu Document Images
In this paper we present a system to locate, extract and recognize Telugu text. The circular nature of Telugu script is exploited for segmenting text regions using the Hough Trans...
Atul Negi, K. Nikhil Shanker, Chandra Kanth Chered...
CIMCA
2006
IEEE
13 years 10 months ago
Identification of Document Language is Not yet a Completely Solved Problem
Existing Language Identification (LID) approaches do reach 100% precision, in most common situations, when dealing with documents written in just one language, and when those docu...
Joaquim Ferreira da Silva, Gabriel Pereira Lopes
ICDAR
2009
IEEE
13 years 6 months ago
Combining Alignment Results for Historical Handwritten Document Analysis
In this paper we propose a new strategy for combining the outputs of several alignment systems. Based on the word boundaries retrieved from a number of individual alignment system...
Emanuel Indermühle, Marcus Liwicki, Horst Bun...
AIRS
2004
Springer
14 years 1 months ago
Document Clustering Using Linear Partitioning Hyperplanes and Reallocation
This paper presents a novel algorithm for document clustering based on a combinatorial framework of the Principal Direction Divisive Partitioning (PDDP) algorithm [1] and a simpli...
Canasai Kruengkrai, Virach Sornlertlamvanich, Hito...
SIGIR
1999
ACM
14 years 24 days ago
Summarizing Text Documents: Sentence Selection and Evaluation Metrics
Human-quality text summarization systems are di cult to design, and even more di cult to evaluate, in part because documents can di er along several dimensions, such as length, wri...
Jade Goldstein, Mark Kantrowitz, Vibhu O. Mittal, ...