Sciweavers

65 search results - page 5 / 13
» Text - Image Separation in Devanagari Documents
Sort
View
ENC
2005
IEEE
14 years 9 days ago
Combining Structural and Textual Contexts for Compressing Semistructured Databases
We describe a compression technique for semistructured documents, called SCMPPM, which combines the Prediction by Partial Matching technique with Structural Contexts Model (SCM) t...
Joaquín Adiego, Pablo de la Fuente, Gonzalo...
DAS
2006
Springer
13 years 10 months ago
Segmentation-Driven Recognition Applied to Numerical Field Extraction from Handwritten Incoming Mail Documents
Abstract. In this paper, we present a method for the automatic extraction of numerical fields (zip codes, phone numbers, etc.) from incoming mail documents. The approach is based o...
Clément Chatelain, Laurent Heutte, Thierry ...
DL
1994
Springer
191views Digital Library» more  DL 1994»
13 years 10 months ago
Corpus Linguistics for Establishing The Natural Language Content of Digital Library Documents
Digital Libraries will hold huge amounts of text and other forms of information. For the collections to be maximally useful, they must be highly organized with useful indexes and ...
Robert P. Futrelle, Xiaolan Zhang 0002, Yumiko Sek...
IDA
2010
Springer
13 years 5 months ago
Selecting the Links in BisoNets Generated from Document Collections
According to Koestler, the notion of a bisociation denotes a connection between pieces of information from habitually separated domains or categories. In this paper, we consider a ...
Marc Segond, Christian Borgelt
AI
2009
Springer
14 years 1 months ago
An Empirical Study of Category Skew on Feature Selection for Text Categorization
In this paper, we present an empirical comparison of the effects of category skew on six feature selection methods. The methods were evaluated on 36 datasets generated from the 20...
Mondelle Simeon, Robert J. Hilderman