Search Sciweavers | Sciweavers

7581 search results - page 1476 / 1517

» Incompleteness in Data Mining

186

click to vote

PODS
2010
ACM

215views Database» more PODS 2010»

An optimal algorithm for the distinct elements problem

16 years 17 days ago

Download www.almaden.ibm.com

We give the ﬁrst optimal algorithm for estimating the number of distinct elements in a data stream, closing a long line of theoretical research on this problem begun by Flajolet...

Daniel M. Kane, Jelani Nelson, David P. Woodruff

claim paper

Read More »

258

click to vote

SIGMOD
2010
ACM

324views Database» more SIGMOD 2010»

16 years 9 days ago

Similarity search and locality sensitive hashing using ternary content addressable memories

Download klamath.stanford.edu

Similarity search methods are widely used as kernels in various data mining and machine learning applications including those in computational biology, web search/clustering. Near...

Rajendra Shinde, Ashish Goel, Pankaj Gupta, Debojy...

claim paper

Read More »

216

click to vote

SIGMOD
2010
ACM

277views Database» more SIGMOD 2010»

A comparison of join algorithms for log processing in MaPreduce

16 years 9 days ago

Download pages.cs.wisc.edu

The MapReduce framework is increasingly being used to analyze large volumes of data. One important type of data analysis done with MapReduce is log processing, in which a click-st...

Spyros Blanas, Jignesh M. Patel, Vuk Ercegovac, Ju...

claim paper

Read More »

213

click to vote

SIGMOD
1998
ACM

99views Database» more SIGMOD 1998»

CURE: An Efficient Clustering Algorithm for Large Databases

15 years 11 months ago

Download www.cs.sfu.ca

Clustering, in data mining, is useful for discovering groups and identifying interesting distributions in the underlying data. Traditional clustering algorithms either favor clust...

Sudipto Guha, Rajeev Rastogi, Kyuseok Shim

claim paper

Read More »

207

click to vote

VLDB
1998
ACM

120views Database» more VLDB 1998»

PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning

15 years 11 months ago

Download www.vldb.org

Classification is an important problem in data mining. Given a database of records, each with a class label, a classifier generates a concise and meaningful description for each c...

Rajeev Rastogi, Kyuseok Shim

claim paper

Read More »

« Prev « First page 1476 / 1517 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers