Search Sciweavers | Sciweavers

32 search results - page 4 / 7

» Reservoir-Based Random Sampling with Replacement from Data S...

131

click to vote

SIGMOD
2010
ACM

281views Database» more SIGMOD 2010»

Continuous sampling for online aggregation over multiple queries

15 years 7 months ago

Download www.comp.nus.edu.sg

In this paper, we propose an online aggregation system called COSMOS (Continuous Sampling for Multiple queries in an Online aggregation System), to process multiple aggregate quer...

Sai Wu, Beng Chin Ooi, Kian-Lee Tan

claim paper

Read More »

109

Voted

ICDM
2007
IEEE

158views Data Mining» more ICDM 2007»

On Appropriate Assumptions to Mine Data Streams: Analysis and Practice

15 years 9 months ago

Download www.weifan.info

Recent years have witnessed an increasing number of studies in stream mining, which aim at building an accurate model for continuously arriving data. Somehow most existing work ma...

Jing Gao, Wei Fan, Jiawei Han

claim paper

Read More »

132

click to vote

SDM
2010
SIAM

195views Data Mining» more SDM 2010»

MACH: Fast Randomized Tensor Decompositions

15 years 4 months ago

Download www.cs.cmu.edu

Tensors naturally model many real world processes which generate multi-aspect data. Such processes appear in many different research disciplines, e.g, chemometrics, computer visio...

Charalampos E. Tsourakakis

claim paper

Read More »

133

click to vote

WWW
2005
ACM

145views Internet Technology» more WWW 2005»

Sampling search-engine results

16 years 3 months ago

Download www2005.org

We consider the problem of efficiently sampling Web search engine query results. In turn, using a small random sample instead of the full set of results leads to efficient approxi...

Aris Anagnostopoulos, Andrei Z. Broder, David Carm...

claim paper

Read More »

165

Voted

BMCBI
2007

147views more BMCBI 2007»

Bias in random forest variable importance measures: Illustrations, sources and a solution

15 years 2 months ago

Download www.stat.uni-muenchen.de

Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classiﬁcation tasks in bioinformatics and relate...

Carolin Strobl, Anne-Laure Boulesteix, Achim Zeile...

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers