Search Sciweavers | Sciweavers

7387 search results - page 1438 / 1478

» Knowledge-based data mining

171

click to vote

PODS
2010
ACM

215views Database» more PODS 2010»

An optimal algorithm for the distinct elements problem

15 years 12 months ago

Download www.almaden.ibm.com

We give the ﬁrst optimal algorithm for estimating the number of distinct elements in a data stream, closing a long line of theoretical research on this problem begun by Flajolet...

Daniel M. Kane, Jelani Nelson, David P. Woodruff

claim paper

Read More »

233

click to vote

SIGMOD
2010
ACM

324views Database» more SIGMOD 2010»

15 years 11 months ago

Similarity search and locality sensitive hashing using ternary content addressable memories

Download klamath.stanford.edu

Similarity search methods are widely used as kernels in various data mining and machine learning applications including those in computational biology, web search/clustering. Near...

Rajendra Shinde, Ashish Goel, Pankaj Gupta, Debojy...

claim paper

Read More »

197

click to vote

SIGMOD
2010
ACM

277views Database» more SIGMOD 2010»

A comparison of join algorithms for log processing in MaPreduce

15 years 11 months ago

Download pages.cs.wisc.edu

The MapReduce framework is increasingly being used to analyze large volumes of data. One important type of data analysis done with MapReduce is log processing, in which a click-st...

Spyros Blanas, Jignesh M. Patel, Vuk Ercegovac, Ju...

claim paper

Read More »

201

click to vote

SIGMOD
1998
ACM

99views Database» more SIGMOD 1998»

CURE: An Efficient Clustering Algorithm for Large Databases

15 years 11 months ago

Download www.cs.sfu.ca

Clustering, in data mining, is useful for discovering groups and identifying interesting distributions in the underlying data. Traditional clustering algorithms either favor clust...

Sudipto Guha, Rajeev Rastogi, Kyuseok Shim

claim paper

Read More »

189

click to vote

VLDB
1998
ACM

120views Database» more VLDB 1998»

PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning

15 years 11 months ago

Download www.vldb.org

Classification is an important problem in data mining. Given a database of records, each with a class label, a classifier generates a concise and meaningful description for each c...

Rajeev Rastogi, Kyuseok Shim

claim paper

Read More »

« Prev « First page 1438 / 1478 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers