Sciweavers

163 search results - page 19 / 33
» Use of Lexicon Density in Evaluating Word Recognizers
Sort
View
ANLP
1994
112views more  ANLP 1994»
13 years 9 months ago
Tagging and Morphological Disambiguation of Turkish Text
Automatic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In language...
Kemal Oflazer, Ilker Kuruöz
EMNLP
2007
13 years 9 months ago
Extending a Thesaurus in the Pan-Chinese Context
In this paper, we address a unique problem in Chinese language processing and report on our study on extending a Chinese thesaurus with region-specific words, mostly from the fina...
Oi Yee Kwong, Benjamin Ka-Yin T'sou
IJDAR
2011
214views more  IJDAR 2011»
13 years 2 months ago
SCUT-COUCH2009 - a comprehensive online unconstrained Chinese handwriting database and benchmark evaluation
: A comprehensive online unconstrained Chinese handwriting dataset, SCUT-COUCH2009, is introduced in this paper. As a revision of SCUT-COUCH2008 [1], the SCUT-COUCH2009 database co...
Lianwen Jin, Yan Gao, Gang Liu, Yunyang Li, Kai Di...
CICLING
2003
Springer
14 years 29 days ago
Experiments with Linguistic Categories for Language Model Optimization
In this work1 we obtain robust category-based language models to be integrated into speech recognition systems. Deductive rules are used to select linguistic categories and to matc...
Arantza Casillas, Amparo Varona, Inés Torre...
NAACL
2003
13 years 9 months ago
A Generative Probabilistic OCR Model for NLP Applications
In this paper, we introduce a generative probabilistic optical character recognition (OCR) model that describes an end-to-end process in the noisy channel framework, progressing f...
Okan Kolak, William J. Byrne, Philip Resnik