KDD 2002 | Sciweavers

184

KDD
2002
ACM

160views Data Mining» more KDD 2002»

Scaling multi-class support vector machines using inter-class confusion

16 years 7 months ago

Support vector machines (SVMs) excel at two-class discriminative learning problems. They often outperform generative classifiers, especially those that use inaccurate generative m...

Shantanu Godbole, Sunita Sarawagi, Soumen Chakraba...

claim paper

Read More »

165

click to vote

KDD
2002
ACM

128views Data Mining» more KDD 2002»

Privacy preserving mining of association rules

16 years 7 months ago

Download www.cs.cornell.edu

We present a framework for mining association rules from transactions consisting of categorical items where the data has been randomized to preserve privacy of individual transact...

Alexandre V. Evfimievski, Ramakrishnan Srikant, Ra...

claim paper

Read More »

161

click to vote

KDD
2002
ACM

170views Data Mining» more KDD 2002»

Web site mining: a new way to spot competitors, customers and suppliers in the world wide web

16 years 7 months ago

Download www.cs.sfu.ca

When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...

Martin Ester, Hans-Peter Kriegel, Matthias Schuber...

claim paper

Read More »

196

click to vote

KDD
2002
ACM

112views Data Mining» more KDD 2002»

From run-time behavior to usage scenarios: an interaction-pattern mining approach

16 years 7 months ago

Download www.lans.ece.utexas.edu

A key challenge facing IT organizations today is their evolution towards adopting e-business practices that gives rise to the need for reengineering their underlying software syst...

Mohammad El-Ramly, Eleni Stroulia, Paul G. Sorenso...

claim paper

Read More »

169

click to vote

KDD
2002
ACM

155views Data Mining» more KDD 2002»

SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets

16 years 7 months ago

Download elvis.slis.indiana.edu

We propose a new clustering algorithm, called SyMP, which is based on synchronization of pulse-coupled oscillators. SyMP represents each data point by an Integrate-and-Fire oscill...

Hichem Frigui

claim paper

Read More »

180

click to vote

KDD
2002
ACM

118views Data Mining» more KDD 2002»

SECRET: a scalable linear regression tree algorithm

16 years 7 months ago

Download www.cs.cornell.edu

Recently there has been an increasing interest in developing regression models for large datasets that are both accurate and easy to interpret. Regressors that have these properti...

Alin Dobra, Johannes Gehrke

claim paper

Read More »

175

Voted

KDD
2002
ACM

170views Data Mining» more KDD 2002»

Enhanced word clustering for hierarchical text classification

16 years 7 months ago

Download www.cs.utexas.edu

In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...

Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...

claim paper

Read More »

172

click to vote

KDD
2002
ACM

125views Data Mining» more KDD 2002»

Pattern discovery in sequences under a Markov assumption

16 years 7 months ago

Download www.ics.uci.edu

In this paper we investigate the general problem of discovering recurrent patterns that are embedded in categorical sequences. An important real-world problem of this nature is mo...

Darya Chudova, Padhraic Smyth

claim paper

Read More »

100

click to vote

KDD
2002
ACM

85views Data Mining» more KDD 2002»

DualMiner: a dual-pruning algorithm for itemsets with constraints

16 years 7 months ago

Download www.cs.cornell.edu

Cristian Bucila, Johannes Gehrke, Daniel Kifer, Wa...

claim paper

Read More »

169

Voted

KDD
2002
ACM

138views Data Mining» more KDD 2002»

Learning to match and cluster large high-dimensional data sets for data integration

16 years 7 months ago

Download www.cs.cmu.edu

Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...

William W. Cohen, Jacob Richman

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers