Sciweavers

832 search results - page 16 / 167
» Robust Spelling Correction
Sort
View
EACL
2006
ACL Anthology
13 years 9 months ago
Web Text Corpus for Natural Language Processing
Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpu...
Vinci Liu, James R. Curran
ACL
1992
13 years 8 months ago
Lattice-Based Word Identification in CLARE
I argue that because of spelling and typing errors and other properties of typed text, the identification of words and word boundaries in general requires syntactic and semantic k...
David M. Carter
CORR
2008
Springer
94views Education» more  CORR 2008»
13 years 7 months ago
Enhanced Integrated Scoring for Cleaning Dirty Texts
An increasing number of approaches for ontology engineering from text are gearing towards the use of online sources such as company intranet and the World Wide Web. Despite such r...
Wilson Wong, Wei Liu, Mohammed Bennamoun
NAACL
2007
13 years 9 months ago
High-Performance, Language-Independent Morphological Segmentation
This paper introduces an unsupervised morphological segmentation algorithm that shows robust performance for four languages with different levels of morphological complexity. In p...
Sajib Dasgupta, Vincent Ng
EMNLP
2010
13 years 5 months ago
Word-Based Dialect Identification with Georeferenced Rules
We present a novel approach for (written) dialect identification based on the discriminative potential of entire words. We generate Swiss German dialect words from a Standard Germ...
Yves Scherrer, Owen Rambow