Sciweavers

265 search results - page 13 / 53
» Statistical-Based Approach to Word Segmentation
Sort
View
ICDAR
2011
IEEE
12 years 7 months ago
Word Retrieval in Historical Document Using Character-Primitives
Word searching and indexing in historical document collections is a challenging problem because, characters in these documents are often touching or broken due to degradation/agei...
Partha Pratim Roy, Jean-Yves Ramel, Nicolas Ragot
ICDAR
2011
IEEE
12 years 7 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...
ACL
2010
13 years 5 months ago
Automatic Sanskrit Segmentizer Using Finite State Transducers
In this paper, we propose a novel method for automatic segmentation of a Sanskrit string into different words. The input for our segmentizer is a Sanskrit string either encoded as...
Vipul Mittal
IJDAR
2011
223views more  IJDAR 2011»
12 years 11 months ago
ICDAR2009 handwriting segmentation contest
The Handwriting Segmentation Contest was organized in the context of ICDAR2009 conference in order to record recent advances in off-line handwriting segmentation. This paper descr...
Basilios Gatos, Nikolaos Stamatopoulos, Georgios L...
NAACL
2010
13 years 5 months ago
Automatic Diacritization for Low-Resource Languages Using a Hybrid Word and Consonant CMM
We are interested in diacritizing Semitic languages, especially Syriac, using only diacritized texts. Previous methods have required the use of tools such as part-of-speech tagger...
Robbie Haertel, Peter McClanahan, Eric K. Ringger