Sciweavers

144 search results - page 6 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
JCIT
2010
115views more  JCIT 2010»
13 years 2 months ago
The Recognition Method of Unknown Chinese Words in Fragments Based on Mutual Information
This paper presents a method of using mutual information to improve the recognition algorithm of unknown Chinese words, it can resolve the complexity of weight settings and the in...
Qian Zhu, Xian-Yi Cheng, Zi-juan Gao
TALIP
2002
108views more  TALIP 2002»
13 years 7 months ago
Toward a unified approach to statistical language modeling for Chinese
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...
ACL
2009
13 years 5 months ago
Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling
In this paper, we propose a new Bayesian model for fully unsupervised word segmentation and an efficient blocked Gibbs sampler combined with dynamic programming for inference. Our...
Daichi Mochihashi, Takeshi Yamada, Naonori Ueda
AI
2009
Springer
14 years 2 months ago
Training Global Linear Models for Chinese Word Segmentation
This paper examines how one can obtain state of the art Chinese word segmentation using global linear models. We provide experimental comparisons that give a detailed road-map for ...
Dong Song, Anoop Sarkar
ACL
2004
13 years 9 months ago
Adaptive Chinese Word Segmentation
This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are...
Jianfeng Gao, Andi Wu, Cheng-Ning Huang, Hongqiao ...