Sciweavers

SDM
2009
SIAM
176views Data Mining» more  SDM 2009»
14 years 8 months ago
Constraint-Based Subspace Clustering.
In high dimensional data, the general performance of traditional clustering algorithms decreases. This is partly because the similarity criterion used by these algorithms becomes ...
Élisa Fromont, Adriana Prado, Céline...
SDM
2009
SIAM
112views Data Mining» more  SDM 2009»
14 years 8 months ago
A Re-evaluation of the Over-Searching Phenomenon in Inductive Rule Learning.
Most commonly used inductive rule learning algorithms employ a hill-climbing search, whereas local pattern discovery algorithms employ exhaustive search. In this paper, we evaluat...
Frederik Janssen, Johannes Fürnkranz
SDM
2009
SIAM
161views Data Mining» more  SDM 2009»
14 years 8 months ago
Polynomial-Delay and Polynomial-Space Algorithms for Mining Closed Sequences, Graphs, and Pictures in Accessible Set Systems.
In this paper, we study efficient closed pattern mining in a general framework of set systems, which are families of subsets ordered by set-inclusion with a certain structure, pro...
Hiroki Arimura, Takeaki Uno
SDM
2009
SIAM
157views Data Mining» more  SDM 2009»
14 years 8 months ago
MUSK: Uniform Sampling of k Maximal Patterns.
Recent research in frequent pattern mining (FPM) has shifted from obtaining the complete set of frequent patterns to generating only a representative (summary) subset of frequent ...
Mohammad Al Hasan, Mohammed Javeed Zaki
SDM
2009
SIAM
114views Data Mining» more  SDM 2009»
14 years 8 months ago
On the Comparison of Relative Clustering Validity Criteria.
Many different relative clustering validity criteria exist that are very useful in practice as quantitative measures for evaluating the quality of data partitions, and new criter...
Lucas Vendramin, Ricardo J. G. B. Campello, Eduard...
SDM
2009
SIAM
179views Data Mining» more  SDM 2009»
14 years 8 months ago
Parallel Large Scale Feature Selection for Logistic Regression.
Daria Sorokina, Jeremy Kubica, Sameer Singh, Scott...
SDM
2009
SIAM
196views Data Mining» more  SDM 2009»
14 years 8 months ago
MultiVis: Content-Based Social Network Exploration through Multi-way Visual Analysis.
With the explosion of social media, scalability becomes a key challenge. There are two main aspects of the problems that arise: 1) data volume: how to manage and analyze huge data...
Ching-Yung Lin, Jimeng Sun, Nan Cao, Shixia Liu, S...
SDM
2009
SIAM
152views Data Mining» more  SDM 2009»
14 years 8 months ago
Multiple Kernel Clustering.
Maximum margin clustering (MMC) has recently attracted considerable interests in both the data mining and machine learning communities. It first projects data samples to a kernel...
Bin Zhao, James T. Kwok, Changshui Zhang
SDM
2009
SIAM
144views Data Mining» more  SDM 2009»
14 years 8 months ago
CORE: Nonparametric Clustering of Large Numeric Databases.
Current clustering techniques are able to identify arbitrarily shaped clusters in the presence of noise, but depend on carefully chosen model parameters. The choice of model param...
Andrej Taliun, Arturas Mazeika, Michael H. Bö...
SDM
2009
SIAM
175views Data Mining» more  SDM 2009»
14 years 8 months ago
Low-Entropy Set Selection.
Most pattern discovery algorithms easily generate very large numbers of patterns, making the results impossible to understand and hard to use. Recently, the problem of instead sel...
Hannes Heikinheimo, Jilles Vreeken, Arno Siebes, H...