SDM 2008 | Sciweavers

185

SDM
2008
SIAM

158views Data Mining» more SDM 2008»

15 years 8 months ago

Similarity Measures for Categorical Data: A Comparative Evaluation

Measuring similarity or distance between two entities is a key step for several data mining and knowledge discovery tasks. The notion of similarity for continuous data is relative...

Shyam Boriah, Varun Chandola, Vipin Kumar

claim paper

Read More »

185

click to vote

SDM
2008
SIAM

133views Data Mining» more SDM 2008»

A RELIEF Based Feature Extraction Algorithm

15 years 8 months ago

Download siam.org

RELIEF is considered one of the most successful algorithms for assessing the quality of features due to its simplicity and effectiveness. It has been recently proved that RELIEF i...

Yijun Sun, Dapeng Wu

claim paper

Read More »

171

click to vote

SDM
2008
SIAM

123views Data Mining» more SDM 2008»

Constrained Co-clustering of Gene Expression Data

15 years 8 months ago

Download siam.org

In many applications, the expert interpretation of coclustering is easier than for mono-dimensional clustering. Co-clustering aims at computing a bi-partition that is a collection...

Ruggero G. Pensa, Jean-François Boulicaut

claim paper

Read More »

147

click to vote

SDM
2008
SIAM

140views Data Mining» more SDM 2008»

Large-Scale Many-Class Learning

15 years 8 months ago

Download www.personal.psu.edu

In many multiclass learning scenarios, the number of classes is relatively large (thousands,...), or the space and time efficiency of the learning system can be crucial. We invest...

Omid Madani, Michael Connor

claim paper

Read More »

193

click to vote

SDM
2008
SIAM

139views Data Mining» more SDM 2008»

Simultaneous Unsupervised Learning of Disparate Clusterings

15 years 8 months ago

Download www.cs.utexas.edu

Most clustering algorithms produce a single clustering for a given data set even when the data can be clustered naturally in multiple ways. In this paper, we address the difficult...

Prateek Jain, Raghu Meka, Inderjit S. Dhillon

claim paper

Read More »

190

Voted

SDM
2008
SIAM

139views Data Mining» more SDM 2008»

Semi-Supervised Learning Based on Semiparametric Regularization

15 years 8 months ago

Download www.fortune.binghamton.edu

Semi-supervised learning plays an important role in the recent literature on machine learning and data mining and the developed semisupervised learning techniques have led to many...

Zhen Guo, Zhongfei (Mark) Zhang, Eric P. Xing, Chr...

claim paper

Read More »

198

click to vote

SDM
2008
SIAM

138views Data Mining» more SDM 2008»

Clustering from Constraint Graphs

15 years 8 months ago

Download www.cs.cmu.edu

In constrained clustering it is common to model the pairwise constraints as edges on the graph of observations. Using results from graph theory, we analyze such constraint graphs ...

Ari Freund, Dan Pelleg, Yossi Richter

claim paper

Read More »

191

click to vote

SDM
2008
SIAM

125views Data Mining» more SDM 2008»

Mining and Ranking Generators of Sequential Patterns

15 years 8 months ago

Download www.comp.nus.edu.sg

Sequential pattern mining first proposed by Agrawal and Srikant has received intensive research due to its wide range applicability in many real-life domains. Various improvements...

David Lo, Siau-Cheng Khoo, Jinyan Li

claim paper

Read More »

162

click to vote

SDM
2008
SIAM

197views Data Mining» more SDM 2008»

A general framework for estimating similarity of datasets and decision trees: exploring semantic similarity of decision trees

15 years 8 months ago

Download cui.unige.ch

Decision trees are among the most popular pattern types in data mining due to their intuitive representation. However, little attention has been given on the definition of measure...

Irene Ntoutsi, Alexandros Kalousis, Yannis Theodor...

claim paper

Read More »

198

click to vote

SDM
2008
SIAM

161views Data Mining» more SDM 2008»

Efficient Maximum Margin Clustering via Cutting Plane Algorithm

15 years 8 months ago

Download siam.org

Maximum margin clustering (MMC) is a recently proposed clustering method, which extends the theory of support vector machine to the unsupervised scenario and aims at finding the m...

Bin Zhao, Fei Wang, Changshui Zhang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers