Sciweavers

2827 search results - page 152 / 566
» Marking Text Documents
Sort
View
CICLING
2007
Springer
14 years 2 months ago
Clustering Narrow-Domain Short Texts by Using the Kullback-Leibler Distance
Clustering short length texts is a difficult task itself, but adding the narrow domain characteristic poses an additional challenge for current clustering methods. We addressed thi...
David Pinto, José-Miguel Benedí, Pao...
ICDAR
2003
IEEE
14 years 1 months ago
Video text recognition using feature compensation as category-dependent feature extraction
When recognizing multiple fonts, geometric features, such as the directional information of strokes, are generally robust against deformation but are weak against degradation. Thi...
Minoru Mori
ICDAR
2003
IEEE
14 years 1 months ago
Learning the lexicon from raw texts for open-vocabulary Korean word recognition
In this paper, we propose a novel method of building a language model for open-vocabulary Korean word recognition. Due to the complex morphology of Korean, it is inappropriate to ...
Sungho Ryu, Jin Hyung Kim
DMKD
2000
ACM
110views Data Mining» more  DMKD 2000»
14 years 19 days ago
Combining Strategies for Extracting Relations from Text Collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Eleazar Eskin, Luis Gravano
AMTA
1998
Springer
14 years 15 days ago
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...
Philip Resnik