Sciweavers

1125 search results - page 33 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
ICML
2005
IEEE
14 years 11 months ago
Multi-way distributional clustering via pairwise interactions
We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions between...
Ron Bekkerman, Ran El-Yaniv, Andrew McCallum
CSB
2004
IEEE
136views Bioinformatics» more  CSB 2004»
14 years 1 months ago
Minimum Entropy Clustering and Applications to Gene Expression Analysis
Clustering is a common methodology for analyzing the gene expression data. In this paper, we present a new clustering algorithm from an information-theoretic point of view. First,...
Haifeng Li, Keshu Zhang, Tao Jiang
DOCENG
2009
ACM
14 years 4 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
KDD
2005
ACM
166views Data Mining» more  KDD 2005»
14 years 10 months ago
A general model for clustering binary data
Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This p...
Tao Li
DOCENG
2008
ACM
13 years 12 months ago
Merging changes in XML documents using reliable context fingerprints
Different dialects of XML have emerged as ubiquitous document exchange formats. For effective collaboration based on such documents, the capability to propagate edit operations pe...
Sebastian Rönnau, Christian Pauli, Uwe M. Bor...