Sciweavers

304 search results - page 8 / 61
» A Semi-Supervised Document Clustering Technique for Informat...
Sort
View
DIAL
2006
IEEE
167views Image Analysis» more  DIAL 2006»
14 years 1 months ago
Tree clustering for layout-based document image retrieval
We describe a system for the retrieval on the basis of layout similarity of document images belonging to collections stored in digital libraries. Layout regions are extracted and ...
Simone Marinai, Emanuele Marino, Giovanni Soda
ECIR
2007
Springer
13 years 9 months ago
A Hierarchical Consensus Architecture for Robust Document Clustering
Abstract. A major problem encountered by text clustering practitioners is the difficulty of determining a priori which is the optimal text representation and clustering technique f...
Xavier Sevillano, Germán Cobo, Francesc Al&...
ICPR
2008
IEEE
14 years 1 months ago
A robust technique for text extraction in mixed-type binary documents
A crucial preprocessing stage in applications such as OCR is text extraction from mixed-type documents. The present work, in contrast to most until now, successfully faces the pro...
Charalambos Strouthopoulos, Athanasios Nikolaidis
CIDM
2007
IEEE
14 years 1 months ago
Distributed Document Clustering Using Word-clusters
−Document clustering has become an increasingly important task in analyzing huge numbers of documents distributed among various sites. The challenging aspect is to analyze this e...
Debzani Deb, Rafal A. Angryk
INEX
2005
Springer
14 years 1 months ago
Clustering XML Documents Using Self-organizing Maps for Structures
Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph ...
Markus Hagenbuchner, Alessandro Sperduti, Ah Chung...