Chinese word segmentation

188

COLING
2010

120views Computational Linguistics» more COLING 2010»

Word-based and Character-based Word Segmentation Models: Comparison and Combination

15 years 1 months ago

We present a theoretical and empirical comparative analysis of the two dominant categories of approaches in Chinese word segmentation: word-based models and character-based models...

Weiwei Sun

claim paper

Read More »

202

click to vote

ACL
2009

185views Computational Linguistics» more ACL 2009»

An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging

15 years 4 months ago

Download www.aclweb.org

In this paper, we present a discriminative word-character hybrid model for joint Chinese word segmentation and POS tagging. Our word-character hybrid model offers high performance...

Canasai Kruengkrai, Kiyotaka Uchimoto, Jun'ichi Ka...

claim paper

Read More »

190

click to vote

ACL
2009

147views Computational Linguistics» more ACL 2009»

Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging - A Case Study

15 years 4 months ago

Download mtgroup.ict.ac.cn

Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence labeling there exist multiple corpora with different a...

Wenbin Jiang, Liang Huang, Qun Liu

claim paper

Read More »

212

click to vote

ACL
2003

127views Computational Linguistics» more ACL 2003»

Improved Source-Channel Models for Chinese Word Segmentation

15 years 8 months ago

Download www.aclweb.org

This paper presents a Chinese word segmentation system that uses improved sourcechannel models of Chinese sentence generation. Chinese words are defined as one of the following fo...

Jianfeng Gao, Mu Li, Changning Huang

claim paper

Read More »

190

click to vote

ACL
2006

149views Computational Linguistics» more ACL 2006»

Subword-Based Tagging for Confidence-Dependent Chinese Word Segmentation

15 years 8 months ago

Download acl.ldc.upenn.edu

We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum entr...

Ruiqiang Zhang, Gen-ichiro Kikui, Eiichiro Sumita

claim paper

Read More »

184

click to vote

ACL
2006

103views Computational Linguistics» more ACL 2006»

Discriminative Pruning of Language Models for Chinese Word Segmentation

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper presents a discriminative pruning method of n-gram language model for Chinese word segmentation. To reduce the size of the language model that is used in a Chinese word...

Jianfeng Li, Haifeng Wang, Dengjun Ren, Guohua Li

claim paper

Read More »

167

click to vote

ACL
2004

89views Computational Linguistics» more ACL 2004»

Adaptive Chinese Word Segmentation

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are...

Jianfeng Gao, Andi Wu, Cheng-Ning Huang, Hongqiao ...

claim paper

Read More »

180

click to vote

LREC
2010

195views Education» more LREC 2010»

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

15 years 8 months ago

Download www.lrec-conf.org

In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western languages. Therefore Chinese word segmentation is considered an ...

Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Can...

claim paper

Read More »

209

click to vote

LREC
2010

188views Education» more LREC 2010»

How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method

15 years 8 months ago

Download www.lrec-conf.org

We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...

Hai Zhao, Yan Song, Chunyu Kit

claim paper

Read More »

185

click to vote

ACL
2007

153views Computational Linguistics» more ACL 2007»

Chinese Segmentation with a Word-Based Perceptron Algorithm

15 years 8 months ago

Download www.cl.cam.ac.uk

Standard approaches to Chinese word segmentation treat the problem as a tagging task, assigning labels to the characters in the sequence indicating whether the character marks a w...

Yue Zhang 0004, Stephen Clark

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers