Sciweavers

3 search results - page 1 / 1
» Mostly-Unsupervised Statistical Segmentation of Japanese Kan...
Sort
View
CORR
2002
Springer
90views Education» more  CORR 2002»
13 years 7 months ago
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...
Rie Kubota Ando, Lillian Lee
COLING
1996
13 years 8 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka