Sciweavers

689 search results - page 4 / 138
» Urdu Word Segmentation
Sort
View
ICDAR
2005
IEEE
14 years 9 days ago
The Neural-based Segmentation of Cursive Words using Enhanced Heuristics
This paper presents an Enhanced Heuristic Segmenter (EHS) and an improved neural-based segmentation technique for segmenting cursive words and validating prospective segmentation ...
Chun Ki Cheng, Michael Blumenstein
COLING
2010
13 years 1 months ago
Nonparametric Word Segmentation for Machine Translation
We present an unsupervised word segmentation model for machine translation. The model uses existing monolingual segmentation techniques and models the joint distribution over sour...
ThuyLinh Nguyen, Stephan Vogel, Noah A. Smith
LREC
2010
195views Education» more  LREC 2010»
13 years 8 months ago
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western languages. Therefore Chinese word segmentation is considered an ...
Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Can...
LREC
2010
170views Education» more  LREC 2010»
13 years 8 months ago
Arabic Word Segmentation for Better Unit of Analysis
The Arabic language has a very rich morphology where a word is composed of zero or more prefixes, a stem and zero or more suffixes. This makes Arabic data sparse compared to other...
Yassine Benajiba, Imed Zitouni
IJCNLP
2005
Springer
14 years 6 days ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou