Sciweavers

689 search results - page 24 / 138
» Urdu Word Segmentation
Sort
View
ACL
2003
13 years 8 months ago
Morphological Analysis of a Large Spontaneous Speech Corpus in Japanese
This paper describes two methods for detecting word segments and their morphological information in a Japanese spontaneous speech corpus, and describes how to tag a large spontane...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
LREC
2008
120views Education» more  LREC 2008»
13 years 8 months ago
LC-STAR II: Starring more Lexica
LC-STAR II is a follow-up project of the EU funded project LC-STAR (Lexica and Corpora for Speech-to-Speech Translation Components, IST-2001-32216). LC-STAR II develops large lexi...
Ute Ziegenhain, Hanne Fersoe, Henk van den Heuvel,...
ICDAR
2011
IEEE
12 years 7 months ago
A Method for Removing Inflectional Suffixes in Word Spotting of Mongolian Kanjur
Abstract—According to characteristics of Mongolian wordformation, a method for removing inflectional suffixes from word images of the Mongolian Kanjur is proposed in this paper. ...
Hongxi Wei, Guanglai Gao, Yulai Bao
IJCNLP
2005
Springer
14 years 26 days ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
COLING
2000
13 years 8 months ago
Automatic Corpus-Based Thai Word Extraction with the C4.5 Learning Algorithm
"Word" is difficult to define in the languages that do not exhibit explicit word boundary, such as Thai. Traditional methods on defining words for this kind of languages...
Virach Sornlertlamvanich, Tanapong Potipiti, Thats...