Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

196

ACL
2010

130views Computational Linguistics» more ACL 2010»

Conditional Random Fields for Word Hyphenation

15 years 5 months ago

Conditional Random Fields for Word Hyphenation

Download www.aclweb.org

Finding allowable places in words to insert hyphens is an important practical problem. The algorithm that is used most often nowadays has remained essentially unchanged for 25 years. This method is the TEX hyphenation algorithm of Knuth and Liang. We present here a hyphenation method that is clearly more accurate. The new method is an application of conditional random fields. We create new training sets for English and Dutch from the CELEX European lexical resource, and achieve error rates for English of less than 0.1% for correctly allowed hyphens, and less than 0.01% for Dutch. Experiments show that both the Knuth/Liang method and a leading current commercial alternative have error rates several times higher for both languages.

Nikolaos Trogkanis, Charles Elkan

Real-time Traffic

ACL 2010 | Computational Linguistics | Error Rates | Important Practical Problem | TEX Hyphenation Algorithm |

claim paper

Related Content

» Discriminative duration modeling for speech recognition with segmental conditional random ...

» Word Sense Disambiguation for All Words using TreeStructured Conditional Random Fields

» SCARF a segmental conditional random field toolkit for speech recognition

» A Simple and Efficient Model Pruning Method for Conditional Random Fields

» Handwritten Word Recognition Using Conditional Random Fields

» Discriminative Word Alignment with Conditional Random Fields

» Applying Conditional Random Fields to Japanese Morphological Analysis

» Conditional Topic Random Fields

» Approximate Parameter Learning in Conditional Random Fields An Empirical Investigation

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Nikolaos Trogkanis, Charles Elkan

Comments (0)