Sciweavers

689 search results - page 14 / 138
» Urdu Word Segmentation
Sort
View
NLPRS
2001
Springer
13 years 11 months ago
A Hierarchical EM Approach to Word Segmentation
We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...
Fuchun Peng, Dale Schuurmans
ICDAR
2003
IEEE
14 years 18 days ago
Word Segmentation of Handwritten Dates in Historical Documents by Combining Semantic A-Priori-Knowledge with Local Features
The recognition of script in historical documents requires suitable techniques in order to identify single words. Segmentation of lines and words is a challenging task because lin...
Markus Feldbach, Klaus D. Tönnies
IDA
2001
Springer
13 years 11 months ago
Self-Supervised Chinese Word Segmentation
Abstract. We propose a new unsupervised training method for acquiring probability models that accurately segment Chinese character sequences into words. By constructing a core lexi...
Fuchun Peng, Dale Schuurmans
ICPR
2000
IEEE
13 years 11 months ago
Statistical-Based Approach to Word Segmentation
Thispaper presents a text word extraction algorithm that takes a set of bounding boxes of glyphs and their associated text lines of a given document andpartitions the glyphs into ...
Yalin Wang, Robert M. Haralick, Ihsin T. Phillips
CIKM
1999
Springer
13 years 11 months ago
Word Segmentation and Recognition for Web Document Framework
It is observed that a better approach to Web information understanding is to base on its document framework, which is mainly consisted of (i) the title and the URL name of the pag...
Chi-Hung Chi, Chen Ding, Andrew Lim