Sciweavers

57 search results - page 1 / 12
» Content Characterization Using Word Shape Tokens
Sort
View
CICLING
2007
Springer
14 years 4 months ago
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization
Abstract. Classification in genres and domains is a major field of research for Information Retrieval (scientific and technical watch, datamining, etc.) and the selection of app...
Guillaume Cleuziou, Céline Poudat
ANLP
1994
105views more  ANLP 1994»
13 years 11 months ago
Modeling Content Identification from Document Images
A new technique to locate content-representing words for a given document image using representation of character shapes is described. A character shape code representation define...
Takehiro Nakayama
ANLP
1994
104views more  ANLP 1994»
13 years 11 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
PAKDD
2009
ACM
263views Data Mining» more  PAKDD 2009»
14 years 4 months ago
Spatial Weighting for Bag-of-Visual-Words and Its Application in Content-Based Image Retrieval
It is a challenging and important task to retrieve images from a large and highly varied image data set based on their visual contents. Problems like how to fill the semantic gap b...
Xin Chen, Xiaohua Hu, Xiajiong Shen