word segmentation | Sciweavers

176

LREC
2010

195views Education» more LREC 2010»

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

15 years 8 months ago

In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western languages. Therefore Chinese word segmentation is considered an ...

Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Can...

claim paper

Read More »

162

click to vote

LREC
2010

209views Education» more LREC 2010»

Arabic Part of Speech Tagging

15 years 8 months ago

Download jones.ling.indiana.edu

Arabic is a morphologically rich language, which presents a challenge for part of speech tagging. In this paper, we compare two novel methods for POS tagging of Arabic without the...

Emad Mohamed, Sandra Kübler

claim paper

Read More »

180

click to vote

COLING
2008

163views Computational Linguistics» more COLING 2008»

Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation

15 years 8 months ago

Download research.microsoft.com

Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...

Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...

claim paper

Read More »

180

click to vote

ACL
2008

156views Computational Linguistics» more ACL 2008»

Joint Word Segmentation and POS Tagging Using a Single Perceptron

15 years 8 months ago

Download www.cl.cam.ac.uk

For Chinese POS tagging, word segmentation is a preliminary step. To avoid error propagation and improve segmentation by utilizing POS information, segmentation and tagging can be...

Yue Zhang 0004, Stephen Clark

claim paper

Read More »

181

click to vote

ACL
2007

127views Computational Linguistics» more ACL 2007»

A Hybrid Approach to Word Segmentation and POS Tagging

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper, we present a hybrid method for word segmentation and POS tagging. The target languages are those in which word boundaries are ambiguous, such as Chinese and Japanes...

Tetsuji Nakagawa, Kiyotaka Uchimoto

claim paper

Read More »

202

click to vote

FLAIRS
2007

181views Artificial Intelligence» more FLAIRS 2007»

Combining Machine Learning with Linguistic Heuristics for Chinese Word Segmentation

15 years 9 months ago

Download www.personal.psu.edu

This paper describes a hybrid model that combines machine learning with linguistic heuristics for integrating unknown word identification with Chinese word segmentation. The model...

Xiaofei Lu

claim paper

Read More »

238

click to vote

NLPRS
2001
Springer

450views Natural Language Processing» more NLPRS 2001»

Vietnamese Word Segmentation

15 years 11 months ago

Download www.afnlp.org

Word segmentation is the first and obligatory task for every NLP. For inflectional languages like English, French, Dutch,.. their word boundaries are simply assumed to be whitespa...

Dinh Dien, Hoang Kiem, Nguyen Van Toan

claim paper

Read More »

149

click to vote

TSD
2005
Springer

67views Signal Processing» more TSD 2005»

Modelling Lexical Stress

16 years 4 days ago

Download mi.eng.cam.ac.uk

Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propo...

Rogier C. van Dalen, Pascal Wiggers, Léon J...

claim paper

Read More »

150

click to vote

IJCNLP
2005
Springer

102views Natural Language Processing» more IJCNLP 2005»

A Lexicon-Constrained Character Model for Chinese Morphological Analysis

16 years 4 days ago

Download www.aclweb.org

Abstract. This paper proposes a lexicon-constrained character model that combines both word and character features to solve complicated issues in Chinese morphological analysis. A ...

Yao Meng, Hao Yu, Fumihito Nishino

claim paper

Read More »

177

click to vote

ICDAR
2007
IEEE

159views Document Analysis» more ICDAR 2007»

An Efficient Word Segmentation Technique for Historical and Degraded Machine-Printed Documents

16 years 29 days ago

Download users.iit.demokritos.gr

Word segmentation is a crucial step for segmentation-free document analysis systems and is used for creating an index based on word matching. In this paper, we propose a novel met...

Michael Makridis, N. Nikolaou, Basilios Gatos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers