Sciweavers

144 search results - page 7 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
IDA
2001
Springer
14 years 10 days ago
Self-Supervised Chinese Word Segmentation
Abstract. We propose a new unsupervised training method for acquiring probability models that accurately segment Chinese character sequences into words. By constructing a core lexi...
Fuchun Peng, Dale Schuurmans
ACL
1997
13 years 9 months ago
A Trainable Rule-based Algorithm for Word Segmentation
This paper presents a trainable rule-based algorithm for performing word segmentation. The algorithm provides a simple, language-independent alternative to large-scale lexicai-bas...
David D. Palmer
ACL
2006
13 years 9 months ago
Subword-Based Tagging for Confidence-Dependent Chinese Word Segmentation
We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum entr...
Ruiqiang Zhang, Gen-ichiro Kikui, Eiichiro Sumita
IJCNLP
2004
Springer
14 years 1 months ago
The Use of SVM for Chinese New Word Identification
We present a study of new word identification (NWI) to improve the performance of a Chinese word segmenter. In this paper the distribution and types of new words are discussed emp...
Hongqiao Li, Changning Huang, Jianfeng Gao, Xiaozh...