Data Mining | Sciweavers

144

KDD
2001
ACM

163views Data Mining» more KDD 2001»

The "DGX" distribution for mining massive, skewed data

16 years 6 months ago

Skewed distributions appear very often in practice. Unfortunately, the traditional Zipf distribution often fails to model them well. In this paper, we propose a new probability di...

Zhiqiang Bi, Christos Faloutsos, Flip Korn

claim paper

Read More »

149

Voted

KDD
2001
ACM

113views Data Mining» more KDD 2001»

Mining massively incomplete data sets by conceptual reconstruction

16 years 6 months ago

Download www.cse.ohio-state.edu

Charu C. Aggarwal, Srinivasan Parthasarathy

claim paper

Read More »

149

click to vote

KDD
2001
ACM

155views Data Mining» more KDD 2001»

Evaluating the novelty of text-mined rules using lexical knowledge

16 years 6 months ago

Download www.ideal.ece.utexas.edu

Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupu...

claim paper

Read More »

178

Voted

KDD
2001
ACM

179views Data Mining» more KDD 2001»

Data mining case study: modeling the behavior of offenders who commit serious sexual assaults

16 years 6 months ago

Download elvis.slis.indiana.edu

Richard Adderley, Peter B. Musgrove

claim paper

Read More »

198

click to vote

KDD
2001
ACM

187views Data Mining» more KDD 2001»

Random projection in dimensionality reduction: applications to image and text data

16 years 6 months ago

Download www.cis.hut.fi

Random projections have recently emerged as a powerful method for dimensionality reduction. Theoretical results indicate that the method preserves distances quite nicely; however,...

Ella Bingham, Heikki Mannila

claim paper

Read More »

145

Voted

KDD
2001
ACM

156views Data Mining» more KDD 2001»

Classification of genes using probabilistic models of microarray expression profiles

16 years 6 months ago

Download noble.gs.washington.edu

Paul Pavlidis, Christopher Tang, William Stafford ...

claim paper

Read More »

146

click to vote

KDD
2001
ACM

169views Data Mining» more KDD 2001»

Hierarchical cluster analysis of SAGE data for cancer profiling

16 years 6 months ago

Download www.cs.ualberta.ca

In this paper we present a method for clustering SAGE (Serial Analysis of Gene Expression) data to detect similarities and dissimilarities between different types of cancer on the...

Jörg Sander, Monica C. Sleumer, Raymond T. Ng

claim paper

Read More »

170

click to vote

KDD
2001
ACM

163views Data Mining» more KDD 2001»

Learning to recognize brain specific proteins based on low-level features from on-line prediction servers

16 years 6 months ago

Download www.ailab.si

During the last decade, the area of bioinformatics has produced an overwhelming amount of data, with the recently published draft of the human genome being the most prominent exam...

Henrik Boström, Joakim Cöster, Lars Aske...

claim paper

Read More »

146

Voted

KDD
2001
ACM

152views Data Mining» more KDD 2001»

A scalable algorithm for clustering protein sequences

16 years 6 months ago