Sciweavers

AUSDM
2006
Springer

Integrated Scoring For Spelling Error Correction, Abbreviation Expansion and Case Restoration in Dirty Text

14 years 4 months ago
Integrated Scoring For Spelling Error Correction, Abbreviation Expansion and Case Restoration in Dirty Text
An increasing number of language and speech applications are gearing towards the use of texts from online sources as input. Despite such rise, not much work can be found in the aspect of integrated approaches for cleaning dirty texts from online sources. This paper presents a mechanism of Integrated Scoring for Spelling error correction, Abbreviation expansion and Case restoration (ISSAC). The idea of ISSAC was first conceived as part of the text preprocessing phase in an ontology engineering project. Evaluations of ISSAC using 400 chat records reveal an improved accuracy of 96.5% over the existing 74.4% based on the use of Aspell only.
Wilson Wong, Wei Liu, Mohammed Bennamoun
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2006
Where AUSDM
Authors Wilson Wong, Wei Liu, Mohammed Bennamoun
Comments (0)