Sciweavers

684 search results - page 28 / 137
» Vietnamese Word Segmentation
Sort
View
ACL
2003
13 years 10 months ago
Morphological Analysis of a Large Spontaneous Speech Corpus in Japanese
This paper describes two methods for detecting word segments and their morphological information in a Japanese spontaneous speech corpus, and describes how to tag a large spontane...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
IJCNLP
2005
Springer
14 years 2 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
COLING
2000
13 years 10 months ago
Automatic Corpus-Based Thai Word Extraction with the C4.5 Learning Algorithm
"Word" is difficult to define in the languages that do not exhibit explicit word boundary, such as Thai. Traditional methods on defining words for this kind of languages...
Virach Sornlertlamvanich, Tanapong Potipiti, Thats...
ICDAR
1995
IEEE
14 years 6 days ago
Handwritten word recognition for real-time applications
—A fast method of handwritten word recognition suitable for real time applications is presented in this paper. Preprocessing, segmentation and feature extraction are implemented ...
Gyeonghwan Kim, Venu Govindaraju
ICIP
2001
IEEE
14 years 10 months ago
Word shape recognition for image-based document retrieval
In this paper, we propose a word shape recognition method for retrieving image-based documents. Document images are segmented at the word level first. Then the proposed method det...
Weihua Huang, Chew Lim Tan, Sam Yuan Sung, Yi Xu