Sciweavers

31 search results - page 2 / 7
» Enhancing Chinese Word Segmentation Using Unlabeled Data
Sort
View
COLING
2008
13 years 8 months ago
Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation
Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...
Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...
IJCNLP
2005
Springer
14 years 26 days ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou
COLING
2002
13 years 7 months ago
Investigating the Relationship between Word Segmentation Performance and Retrieval Performance in Chinese IR
It is commonly believed that word segmentation accuracy is monotonically related to retrieval performance in Chinese information retrieval. In this paper we show that, for Chinese...
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick ...
ACL
2004
13 years 8 months ago
Adaptive Chinese Word Segmentation
This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are...
Jianfeng Gao, Andi Wu, Cheng-Ning Huang, Hongqiao ...
SIGIR
2002
ACM
13 years 7 months ago
Using self-supervised word segmentation in Chinese information retrieval
We propose a self-supervised word-segmentation technique for Chinese information retrieval. This method combines the advantages of traditional dictionary based approaches with cha...
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick ...