Sciweavers

512 search results - page 95 / 103
» A High-Performance Distributed Algorithm for Mining Associat...
Sort
View
KDD
1998
ACM
145views Data Mining» more  KDD 1998»
14 years 20 days ago
Coincidence Detection: A Fast Method for Discovering Higher-Order Correlations in Multidimensional Data
Wepresent a novel, fast methodfor associationminingill high-dimensionaldatasets. OurCoincidence Detection method, which combines random sampling and Chernoff-Hoeffding bounds with...
Evan W. Steeg, Derek A. Robinson, Ed Willis
SDM
2008
SIAM
139views Data Mining» more  SDM 2008»
13 years 10 months ago
Simultaneous Unsupervised Learning of Disparate Clusterings
Most clustering algorithms produce a single clustering for a given data set even when the data can be clustered naturally in multiple ways. In this paper, we address the difficult...
Prateek Jain, Raghu Meka, Inderjit S. Dhillon
VLDB
2004
ACM
163views Database» more  VLDB 2004»
14 years 1 months ago
Compressing Large Boolean Matrices using Reordering Techniques
Large boolean matrices are a basic representational unit in a variety of applications, with some notable examples being interactive visualization systems, mining large graph struc...
David S. Johnson, Shankar Krishnan, Jatin Chhugani...
KDD
2012
ACM
179views Data Mining» more  KDD 2012»
11 years 11 months ago
Transparent user models for personalization
Personalization is a ubiquitous phenomenon in our daily online experience. While such technology is critical for helping us combat the overload of information we face, in many cas...
Khalid El-Arini, Ulrich Paquet, Ralf Herbrich, Jur...
BMCBI
2008
141views more  BMCBI 2008»
13 years 8 months ago
The development of PIPA: an integrated and automated pipeline for genome-wide protein function annotation
Background: Automated protein function prediction methods are needed to keep pace with high-throughput sequencing. With the existence of many programs and databases for inferring ...
Chenggang Yu, Nela Zavaljevski, Valmik Desai, Seth...