Sciweavers

1125 search results - page 29 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
ICCS
2009
Springer
14 years 4 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
CIKM
2008
Springer
13 years 12 months ago
A language for manipulating clustered web documents results
We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is ...
Gloria Bordogna, Alessandro Campi, Giuseppe Psaila...
ICDAR
2011
IEEE
12 years 9 months ago
Ternary Entropy-Based Binarization of Degraded Document Images Using Morphological Operators
—A vast number of historical and badly degraded document images can be found in libraries, public, and national archives. Due to the complex nature of different artifacts, such p...
T. Hoang Ngan Le, Tien D. Bui, Ching Y. Suen
ICPR
2008
IEEE
14 years 4 months ago
Ancient document analysis based on text line extraction
In order to preserve our cultural heritage and for automated document processing libraries and national archives have started digitizing historical documents. In the case of degra...
Florian Kleber, Robert Sablatnig, Melanie Gau, Hei...
PRL
2006
77views more  PRL 2006»
13 years 10 months ago
Wavelet based approach to cluster analysis. Application on low dimensional data sets
In this paper, we present a wavelet based approach which tries to automatically find the number of clusters present in a data set, along with their position and statistical proper...
Xavier Otazu, Oriol Pujol