Search Sciweavers | Sciweavers

176

NLPRS
2001
Springer

94views Natural Language Processing» more NLPRS 2001»

A Hierarchical EM Approach to Word Segmentation

15 years 11 months ago

We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...

Fuchun Peng, Dale Schuurmans

claim paper

Read More »

237

click to vote

ICDAR
2003
IEEE

113views Document Analysis» more ICDAR 2003»

Word Segmentation of Handwritten Dates in Historical Documents by Combining Semantic A-Priori-Knowledge with Local Features

16 years 18 days ago

Download www.cse.salford.ac.uk

The recognition of script in historical documents requires suitable techniques in order to identify single words. Segmentation of lines and words is a challenging task because lin...

Markus Feldbach, Klaus D. Tönnies

claim paper

Read More »

194

click to vote

IDA
2001
Springer

93views Information Technology» more IDA 2001»

Self-Supervised Chinese Word Segmentation

15 years 11 months ago

Download ai.uwaterloo.ca

Abstract. We propose a new unsupervised training method for acquiring probability models that accurately segment Chinese character sequences into words. By constructing a core lexi...

Fuchun Peng, Dale Schuurmans

claim paper

Read More »

176

click to vote

ICPR
2000
IEEE

190views computer vision» more ICPR 2000»

Statistical-Based Approach to Word Segmentation

15 years 11 months ago

Download www.math.ucla.edu

Thispaper presents a text word extraction algorithm that takes a set of bounding boxes of glyphs and their associated text lines of a given document andpartitions the glyphs into ...

Yalin Wang, Robert M. Haralick, Ihsin T. Phillips

claim paper

Read More »

222

click to vote

CIKM
1999
Springer

124views Information Technology» more CIKM 1999»

Word Segmentation and Recognition for Web Document Framework

15 years 11 months ago

Download www.scs.ryerson.ca

It is observed that a better approach to Web information understanding is to base on its document framework, which is mainly consisted of (i) the title and the URL name of the pag...

Chi-Hung Chi, Chen Ding, Andrew Lim

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers