Lattice-Based Word Identification in CLARE

15 years 3 months ago

Download www.aclweb.org

I argue that because of spelling and typing errors and other properties of typed text, the identification of words and word boundaries in general requires syntactic and semantic knowledge. A lattice representation is therefore appropriate for lexical analysis. I show how the use of such a representation in the CLARE system allows different kinds of hypothesis about word identity to be integrated in a uniform framework. I then describe a quantitative evaluation of CLARE's performance on a set of sentences into which typographic errors have been introduced. The results show that syntax and semantics can be applied as powerful sources of constraint on the possible corrections for misspelled words.

David M. Carter

Real-time Traffic

ACL 1992 | ACL 2007 | General Requires Syntactic | Lattice Representation | Typing Errors |

claim paper

Post Info
More Details (n/a)

Added	06 Nov 2010
Updated	06 Nov 2010
Type	Conference
Year	1992
Where	ACL
Authors	David M. Carter

Comments (0)

Sciweavers

Lattice-Based Word Identification in CLARE

ACL 1992 | ACL 2007 | General Requires Syntactic | Lattice Representation | Typing Errors |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers