Sciweavers

106 search results - page 7 / 22
» kdd 2002
Sort
View
KDD
2002
ACM
122views Data Mining» more  KDD 2002»
14 years 10 months ago
Hierarchical model-based clustering of large datasets through fractionation and refractionation
The goal of clustering is to identify distinct groups in a dataset. Compared to non-parametric clustering methods like complete linkage, hierarchical model-based clustering has th...
Jeremy Tantrum, Alejandro Murua, Werner Stuetzle
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
14 years 10 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
KDD
2002
ACM
214views Data Mining» more  KDD 2002»
14 years 10 months ago
Privacy preserving association rule mining in vertically partitioned data
Privacy considerations often constrain data mining projects. This paper addresses the problem of association rule mining where transactions are distributed across sources. Each si...
Jaideep Vaidya, Chris Clifton
KDD
2002
ACM
187views Data Mining» more  KDD 2002»
14 years 10 months ago
Transforming classifier scores into accurate multiclass probability estimates
Class membership probability estimates are important for many applications of data mining in which classification outputs are combined with other sources of information for decisi...
Bianca Zadrozny, Charles Elkan
KDD
2002
ACM
189views Data Mining» more  KDD 2002»
14 years 10 months ago
Sequential PAttern mining using a bitmap representation
We introduce a new algorithm for mining sequential patterns. Our algorithm is especially efficient when the sequential patterns in the database are very long. We introduce a novel...
Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Y...