Search Sciweavers | Sciweavers

31 search results - page 4 / 7

» Enhancing Chinese Word Segmentation Using Unlabeled Data

157

click to vote

TALIP
2002

108views more TALIP 2002»

Toward a unified approach to statistical language modeling for Chinese

15 years 6 months ago

Download research.microsoft.com

This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...

Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...

claim paper

Read More »

178

click to vote

ICPR
2002
IEEE

148views computer vision» more ICPR 2002»

Incorporating Conditional Independence Assumption with Support Vector Machines to Enhance Handwritten Character Segmentation Per

16 years 7 months ago

Download www.icsd.aegean.gr

Learning Bayesian Belief Networks (BBN) from corpora and incorporating the extracted inferring knowledge with a Support Vector Machines (SVM) classifier has been applied to charac...

Manolis Maragoudakis, Ergina Kavallieratou, Nikos ...

claim paper

Read More »

186

click to vote

LREC
2010

188views Education» more LREC 2010»

How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method

15 years 7 months ago

Download www.lrec-conf.org

We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...

Hai Zhao, Yan Song, Chunyu Kit

claim paper

Read More »

184

Voted

KDD
2009
ACM

211views Data Mining» more KDD 2009»

Address standardization with latent semantic association

16 years 6 months ago

Download www-ai.cs.uni-dortmund.de

Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...

Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...

claim paper

Read More »

204

click to vote

ACL
2009

167views Computational Linguistics» more ACL 2009»

Mining Bilingual Data from the Web with Adaptively Learnt Patterns

15 years 4 months ago

Download www.aclweb.org

Mining bilingual data (including bilingual sentences and terms1 ) from the Web can benefit many NLP applications, such as machine translation and cross language information retrie...

Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, ...

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers