word segmentation | Sciweavers

40

EMNLP
2010

153views Natural Language Processing» more EMNLP 2010»

Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping

13 years 10 months ago

Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...

Baobao Chang, Dongxu Han

claim paper

Read More »

33

click to vote

CORR
2002
Springer

90views Education» more CORR 2002»

Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences

14 years 7 days ago

Download www.cs.cornell.edu

Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...

Rie Kubota Ando, Lillian Lee

claim paper

Read More »

33

click to vote

COLING
2002

96views Computational Linguistics» more COLING 2002»

Investigating the Relationship between Word Segmentation Performance and Retrieval Performance in Chinese IR

14 years 8 days ago

Download acl.ldc.upenn.edu

It is commonly believed that word segmentation accuracy is monotonically related to retrieval performance in Chinese information retrieval. In this paper we show that, for Chinese...

Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick ...

claim paper

Read More »

34

click to vote

COLING
1996

160views Computational Linguistics» more COLING 1996»

The Automatic Extraction of Open Compounds from Text Corpora

14 years 1 months ago

Download acl.ldc.upenn.edu

This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...

Virach Sornlertlamvanich, Hozumi Tanaka

claim paper

Read More »

28

click to vote

COLING
1994

92views Computational Linguistics» more COLING 1994»

An IBM-PC Environment For Chinese Corpus Analysis

14 years 1 months ago

Download acl.ldc.upenn.edu

This paper describes a set of computer programs for Chinese corpus analysis. These programs include (1) extraction of different characters, bigrams and words; (2) word segmentatio...

Robert Wing Pong Luk

claim paper

Read More »

22

click to vote

ACL
1997

120views Computational Linguistics» more ACL 1997»

A Trainable Rule-based Algorithm for Word Segmentation

14 years 1 months ago

Download www.aclweb.org

This paper presents a trainable rule-based algorithm for performing word segmentation. The algorithm provides a simple, language-independent alternative to large-scale lexicai-bas...

David D. Palmer

claim paper

Read More »

31

click to vote

COLING
2000

151views Computational Linguistics» more COLING 2000»

Automatic Corpus-Based Thai Word Extraction with the C4.5 Learning Algorithm

14 years 1 months ago

Download acl.ldc.upenn.edu

"Word" is difficult to define in the languages that do not exhibit explicit word boundary, such as Thai. Traditional methods on defining words for this kind of languages...

Virach Sornlertlamvanich, Tanapong Potipiti, Thats...

claim paper

Read More »

38

click to vote

EMNLP
2004

168views Natural Language Processing» more EMNLP 2004»

Chinese Part-of-Speech Tagging: One-at-a-Time or All-at-Once? Word-Based or Character-Based?

14 years 1 months ago

Download www.aclweb.org

Chinese part-of-speech (POS) tagging assigns one POS tag to each word in a Chinese sentence. However, since words are not demarcated in a Chinese sentence, Chinese POS tagging req...

Hwee Tou Ng, Jin Kiat Low

claim paper

Read More »

36

click to vote

ACL
2006

125views Computational Linguistics» more ACL 2006»

Contextual Dependencies in Unsupervised Word Segmentation

14 years 1 months ago

Download cocosci.berkeley.edu

Developing better methods for segmenting continuous text into words is important for improving the processing of Asian languages, and may shed light on how humans learn to segment...

Sharon Goldwater, Thomas L. Griffiths, Mark Johnso...

claim paper

Read More »

32

click to vote

ACL
2004

89views Computational Linguistics» more ACL 2004»

Adaptive Chinese Word Segmentation

14 years 1 months ago

Download acl.ldc.upenn.edu

This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are...

Jianfeng Gao, Andi Wu, Cheng-Ning Huang, Hongqiao ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers