Search Sciweavers | Sciweavers

463 search results - page 6 / 93

» Accuracy Estimation With Clustered Dataset

334

click to vote

ICDE
2008
IEEE

141views Database» more ICDE 2008»

A General Framework for Fast Co-clustering on Large Datasets Using Matrix Decomposition

16 years 8 months ago

Download www.cs.unc.edu

Abstract-- Simultaneously clustering columns and rows (coclustering) of large data matrix is an important problem with wide applications, such as document mining, microarray analys...

Feng Pan, Xiang Zhang, Wei Wang 0010

claim paper

Read More »

304

click to vote

SIGMOD
2008
ACM

157views Database» more SIGMOD 2008»

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition

16 years 7 months ago

Download compgen.unc.edu

The problem of simultaneously clustering columns and rows (coclustering) arises in important applications, such as text data mining, microarray analysis, and recommendation system...

Feng Pan, Xiang Zhang, Wei Wang 0010

claim paper

Read More »

196

click to vote

KDD
2009
ACM

227views Data Mining» more KDD 2009»

Efficiently learning the accuracy of labeling sources for selective sampling

16 years 7 months ago

Download www.cs.cmu.edu

Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...

Pinar Donmez, Jaime G. Carbonell, Jeff Schneider

claim paper

Read More »

180

click to vote

IJHPCA
2007

88views more IJHPCA 2007»

Scaling Properties of Common Statistical Operators for Gridded Datasets

15 years 7 months ago

Download dust.ess.uci.edu

An accurate cost-model that accounts for dataset size and structure can help optimize geoscience data analysis. We develop and apply a computational model to estimate data analysi...

Charles S. Zender, Harry Mangalam

claim paper

Read More »

184

Voted

ALGORITHMICA
2006

139views more ALGORITHMICA 2006»

CONQUEST: A Coarse-Grained Algorithm for Constructing Summaries of Distributed Discrete Datasets

15 years 7 months ago

Download vorlon.case.edu

Abstract. In this paper we present a coarse-grained parallel algorithm, CONQUEST, for constructing boundederror summaries of high-dimensional binary attributed data in a distribute...

Jie Chi, Mehmet Koyutürk, Ananth Grama

claim paper

Read More »

« Prev « First page 6 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers