Search Sciweavers | Sciweavers

5093 search results - page 860 / 1019

» How Real are Real Numbers

182

KDD
2004
ACM

134views Data Mining» more KDD 2004»

Exploiting a support-based upper bound of Pearson's correlation coefficient for efficiently identifying strongly correlated pair

16 years 6 months ago

Download cimic.rutgers.edu

Given a user-specified minimum correlation threshold and a market basket database with N items and T transactions, an all-strong-pairs correlation query finds all item pairs with...

Hui Xiong, Shashi Shekhar, Pang-Ning Tan, Vipin Ku...

claim paper

Read More »

188

click to vote

KDD
2001
ACM

253views Data Mining» more KDD 2001»

GESS: a scalable similarity-join algorithm for mining large data sets in high dimensional spaces

16 years 6 months ago

Download elvis.slis.indiana.edu

The similarity join is an important operation for mining high-dimensional feature spaces. Given two data sets, the similarity join computes all tuples (x, y) that are within a dis...

Jens-Peter Dittrich, Bernhard Seeger

claim paper

Read More »

152

click to vote

STOC
2002
ACM

119views Algorithms» more STOC 2002»

Space-efficient approximate Voronoi diagrams

16 years 6 months ago

Download www.cs.ust.hk

Given a set S of n points in IRd , a (t, )-approximate Voronoi diagram (AVD) is a partition of space into constant complexity cells, where each cell c is associated with t represe...

Sunil Arya, Theocharis Malamatos, David M. Mount

claim paper

Read More »

248

click to vote

SIGMOD
2009
ACM

125views Database» more SIGMOD 2009»

Top-k queries on uncertain data: on score distribution and typical answers

16 years 6 months ago

Download db.csail.mit.edu

Uncertain data arises in a number of domains, including data integration and sensor networks. Top-k queries that rank results according to some user-defined score are an important...

Tingjian Ge, Stanley B. Zdonik, Samuel Madden

claim paper

Read More »

273

click to vote

SIGMOD
2008
ACM

157views Database» more SIGMOD 2008»

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition

16 years 6 months ago

Download compgen.unc.edu

The problem of simultaneously clustering columns and rows (coclustering) arises in important applications, such as text data mining, microarray analysis, and recommendation system...

Feng Pan, Xiang Zhang, Wei Wang 0010

claim paper

Read More »

« Prev « First page 860 / 1019 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers