Sciweavers

5659 search results - page 1016 / 1132
» Comparing Clusterings in Space
Sort
View
155
Voted
KDD
2006
ACM
164views Data Mining» more  KDD 2006»
16 years 4 months ago
Assessing data mining results via swap randomization
The problem of assessing the significance of data mining results on high-dimensional 0?1 data sets has been studied extensively in the literature. For problems such as mining freq...
Aristides Gionis, Heikki Mannila, Panayiotis Tsapa...
KDD
2005
ACM
161views Data Mining» more  KDD 2005»
16 years 4 months ago
Combining email models for false positive reduction
Machine learning and data mining can be effectively used to model, classify and discover interesting information for a wide variety of data including email. The Email Mining Toolk...
Shlomo Hershkop, Salvatore J. Stolfo
RECOMB
2007
Springer
16 years 4 months ago
An Efficient and Accurate Graph-Based Approach to Detect Population Substructure
Currently, large-scale projects are underway to perform whole genome disease association studies. Such studies involve the genotyping of hundreds of thousands of SNP markers. One o...
Srinath Sridhar, Satish Rao, Eran Halperin
STOC
2004
ACM
145views Algorithms» more  STOC 2004»
16 years 4 months ago
Using mixture models for collaborative filtering
A collaborative filtering system at an e-commerce site or similar service uses data about aggregate user behavior to make recommendations tailored to specific user interests. We d...
Jon M. Kleinberg, Mark Sandler
230
Voted
SIGMOD
2009
ACM
136views Database» more  SIGMOD 2009»
16 years 4 months ago
A comparison of approaches to large-scale data analysis
There is currently considerable enthusiasm around the MapReduce (MR) paradigm for large-scale data analysis [17]. Although the basic control flow of this framework has existed in ...
Andrew Pavlo, Erik Paulson, Alexander Rasin, Danie...
« Prev « First page 1016 / 1132 Last » Next »