Sciweavers

SDM
2009
SIAM
114views Data Mining» more  SDM 2009»
14 years 8 months ago
On the Comparison of Relative Clustering Validity Criteria.
Many different relative clustering validity criteria exist that are very useful in practice as quantitative measures for evaluating the quality of data partitions, and new criter...
Lucas Vendramin, Ricardo J. G. B. Campello, Eduard...
SDM
2009
SIAM
179views Data Mining» more  SDM 2009»
14 years 8 months ago
Parallel Large Scale Feature Selection for Logistic Regression.
Daria Sorokina, Jeremy Kubica, Sameer Singh, Scott...
SDM
2009
SIAM
196views Data Mining» more  SDM 2009»
14 years 8 months ago
MultiVis: Content-Based Social Network Exploration through Multi-way Visual Analysis.
With the explosion of social media, scalability becomes a key challenge. There are two main aspects of the problems that arise: 1) data volume: how to manage and analyze huge data...
Ching-Yung Lin, Jimeng Sun, Nan Cao, Shixia Liu, S...
SDM
2009
SIAM
152views Data Mining» more  SDM 2009»
14 years 8 months ago
Multiple Kernel Clustering.
Maximum margin clustering (MMC) has recently attracted considerable interests in both the data mining and machine learning communities. It first projects data samples to a kernel...
Bin Zhao, James T. Kwok, Changshui Zhang
SDM
2009
SIAM
144views Data Mining» more  SDM 2009»
14 years 8 months ago
CORE: Nonparametric Clustering of Large Numeric Databases.
Current clustering techniques are able to identify arbitrarily shaped clusters in the presence of noise, but depend on carefully chosen model parameters. The choice of model param...
Andrej Taliun, Arturas Mazeika, Michael H. Bö...
SDM
2009
SIAM
175views Data Mining» more  SDM 2009»
14 years 8 months ago
Low-Entropy Set Selection.
Most pattern discovery algorithms easily generate very large numbers of patterns, making the results impossible to understand and hard to use. Recently, the problem of instead sel...
Hannes Heikinheimo, Jilles Vreeken, Arno Siebes, H...
SDM
2009
SIAM
149views Data Mining» more  SDM 2009»
14 years 8 months ago
Near-optimal Supervised Feature Selection among Frequent Subgraphs.
Graph classification is an increasingly important step in numerous application domains, such as function prediction of molecules and proteins, computerised scene analysis, and an...
Alexander J. Smola, Arthur Gretton, Hans-Peter Kri...
SDM
2009
SIAM
173views Data Mining» more  SDM 2009»
14 years 8 months ago
Discretized Spatio-Temporal Scan Window.
The focus of this paper is the discovery of anomalous spatio-temporal windows. We propose a Discretized SpatioTemporal Scan Window approach to address the question of how we can t...
Aryya Gangopadhyay, Seyed H. Mohammadi, Vandana Pu...
SDM
2009
SIAM
138views Data Mining» more  SDM 2009»
14 years 8 months ago
ShatterPlots: Fast Tools for Mining Large Graphs.
Graphs appear in several settings, like social networks, recommendation systems, computer communication networks, gene/protein biological networks, among others. A deep, recurring...
Ana Paula Appel, Andrew Tomkins, Christos Faloutso...
SDM
2009
SIAM
123views Data Mining» more  SDM 2009»
14 years 8 months ago
Measuring Discrimination in Socially-Sensitive Decision Records.
Discrimination in social sense (e.g., against minorities and disadvantaged groups) is the subject of many laws worldwide, and it has been extensively studied in the social and eco...
Dino Pedreschi, Franco Turini, Salvatore Ruggieri