Sciweavers

ANLP
2000

Categorizing Unknown Words: Using Decision Trees to Identify Names and Misspellings

14 years 28 days ago
Categorizing Unknown Words: Using Decision Trees to Identify Names and Misspellings
This paper introduces a system for categorizing unknown words. The system is based on a multicomponent architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the components that identify names and spelling errors. Each component uses a decision tree architecture to combine multiple types of evidence about the unknown word. The system is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.
Janine Toole
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 2000
Where ANLP
Authors Janine Toole
Comments (0)