Sciweavers

111 search results - page 5 / 23
» Word Segmentation of Vietnamese Texts: a Comparison of Appro...
Sort
View
ICDAR
2011
IEEE
12 years 6 months ago
Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach
—Libraries in South Asia hold huge collections of valuable printed documents in Urdu and it is of interest to digitize these collections to make them more accessible. The unavail...
Ali Abidi, Imran Siddiqi, Khurram Khurshid
ACL
2010
13 years 5 months ago
Automatic Sanskrit Segmentizer Using Finite State Transducers
In this paper, we propose a novel method for automatic segmentation of a Sanskrit string into different words. The input for our segmentizer is a Sanskrit string either encoded as...
Vipul Mittal
COLING
2008
13 years 8 months ago
Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation
Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...
Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...
PKDD
2009
Springer
150views Data Mining» more  PKDD 2009»
14 years 1 months ago
Omiotis: A Thesaurus-Based Measure of Text Relatedness
In this paper we present a new approach for measuring the relatedness between text segments, based on implicit semantic links between their words, as offered by a word thesaurus, n...
George Tsatsaronis, Iraklis Varlamis, Michalis Vaz...
CAIP
2009
Springer
246views Image Analysis» more  CAIP 2009»
13 years 11 months ago
A Novel Approach for Word Spotting Using Merge-Split Edit Distance
Edit distance matching has been used in literature for word spotting with characters taken as primitives. The recognition rate however, is limited by the segmentation inconsistenci...
Khurram Khurshid, Claudie Faure, Nicole Vincent