Sciweavers

220 search results - page 24 / 44
» Language Independent Text Categorization
Sort
View
LREC
2008
70views Education» more  LREC 2008»
13 years 9 months ago
Process Model for Composing High-quality Text Corpora
The Teko corpus composing model offers a decentralized, dynamic way of collecting high-quality text corpora for linguistic research. The resulting corpus consists of independent t...
Mikko Lounela
LREC
2008
88views Education» more  LREC 2008»
13 years 9 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
ECMDAFA
2006
Springer
104views Hardware» more  ECMDAFA 2006»
13 years 11 months ago
The Epsilon Object Language (EOL)
Model-Driven Development requires model management languages and tools for supporting model operations such as editing, consistency checking, and transformation. At the core of the...
Dimitrios S. Kolovos, Richard F. Paige, Fiona Pola...
ICDAR
2005
IEEE
14 years 1 months ago
Text Recognition of Low-resolution Document Images
Cheap and versatile cameras make it possible to easily and quickly capture a wide variety of documents. However, low resolution cameras present a challenge to OCR because it is vi...
Charles E. Jacobs, Patrice Y. Simard, Paul A. Viol...
IR
2006
13 years 7 months ago
Multilingual modeling of cross-lingual spelling variants
Technical term translations are important for cross-lingual information retrieval. In many languages, new technical terms have a common origin rendered with different spelling of ...
Krister Lindén