Sciweavers

65 search results - page 6 / 13
» Text - Image Separation in Devanagari Documents
Sort
View
DOCENG
2007
ACM
13 years 10 months ago
Extracting reusable document components for variable data printing
Variable Data Printing (VDP) has brought new flexibility and dynamism to the printed page. Each printed instance of a specific class of document can now have different degrees of ...
Steven R. Bagley, David F. Brailsford, James A. Ol...
IPM
2007
95views more  IPM 2007»
13 years 6 months ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
SDM
2004
SIAM
174views Data Mining» more  SDM 2004»
13 years 8 months ago
Classifying Documents Without Labels
Automatic classification of documents is an important area of research with many applications in the fields of document searching, forensics and others. Methods to perform classif...
Daniel Barbará, Carlotta Domeniconi, Ning K...
TREC
2004
13 years 8 months ago
Experiments in Terabyte Searching, Genomic Retrieval and Novelty Detection for TREC 2004
: In TREC2004, Dublin City University took part in three tracks, Terabyte (in collaboration with University College Dublin), Genomic and Novelty. In this paper we will discuss each...
Stephen Blott, Fabrice Camous, Paul Ferguson, Geor...
CIKM
2005
Springer
14 years 7 days ago
Generating better concept hierarchies using automatic document classification
This paper presents a hybrid concept hierarchy development technique for web returned documents retrieved by a meta-search engine. The aim of the technique is to separate the init...
Razvan Stefan Bot, Yi-fang Brook Wu, Xin Chen, Qua...