Sciweavers

629 search results - page 26 / 126
» Document clustering based on cluster validation
Sort
View
DAS
2006
Springer
13 years 11 months ago
Efficient Word Retrieval by Means of SOM Clustering and PCA
Abstract. We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Orga...
Simone Marinai, Stefano Faini, Emanuele Marino, Gi...
DIAL
2006
IEEE
167views Image Analysis» more  DIAL 2006»
14 years 1 months ago
Tree clustering for layout-based document image retrieval
We describe a system for the retrieval on the basis of layout similarity of document images belonging to collections stored in digital libraries. Layout regions are extracted and ...
Simone Marinai, Emanuele Marino, Giovanni Soda
AIRS
2004
Springer
14 years 1 months ago
Document Clustering Using Linear Partitioning Hyperplanes and Reallocation
This paper presents a novel algorithm for document clustering based on a combinatorial framework of the Principal Direction Divisive Partitioning (PDDP) algorithm [1] and a simpli...
Canasai Kruengkrai, Virach Sornlertlamvanich, Hito...
ECIR
2006
Springer
13 years 9 months ago
Clustering-Based Searching and Navigation in an Online News Source
The growing amount of online news posted on the WWW demands new algorithms that support topic detection, search, and navigation of news documents. This work presents an algorithm f...
Simón C. Smith, M. Andrea Rodríguez
SIGIR
2008
ACM
13 years 7 months ago
Knowledge transformation from word space to document space
In most IR clustering problems, we directly cluster the documents, working in the document space, using cosine similarity between documents as the similarity measure. In many real...
Tao Li, Chris H. Q. Ding, Yi Zhang 0005, Bo Shao