massive data sets | Sciweavers

59

ICDE
2012
IEEE

216views Database» more ICDE 2012»

Load Balancing in MapReduce Based on Scalable Cardinality Estimates

12 years 4 months ago

—MapReduce has emerged as a popular tool for distributed and scalable processing of massive data sets and is increasingly being used in e-science applications. Unfortunately, the...

Benjamin Gufler, Nikolaus Augsten, Angelika Reiser...

claim paper

Read More »

48

click to vote

ISCA
2011
IEEE

269views Hardware» more ISCA 2011»

Power management of online data-intensive services

13 years 6 months ago

Download davidmeisner.org

Much of the success of the Internet services model can be attributed to the popularity of a class of workloads that we call Online Data-Intensive (OLDI) services. These workloads ...

David Meisner, Christopher M. Sadler, Luiz Andr&ea...

claim paper

Read More »

49

click to vote

JMIV
2011

179views more JMIV 2011»

3-D Data Denoising and Inpainting with the Low-Redundancy Fast Curvelet Transform

13 years 9 months ago

Download jstarck.free.fr

In this paper, we ﬁrst present a new implementation of the 3-D fast curvelet transform, which is nearly 2.5 less redundant than the Curvelab (wrapping-based) implementation as o...

A. Woiselle, Jean-Luc Starck, Jalal Fadili

claim paper

Read More »

41

click to vote

CORR
2010
Springer

70views Education» more CORR 2010»

Computation in Large-Scale Scientific and Internet Data Applications is a Focus of MMDS 2010

13 years 11 months ago

Download www.sigkdd.org

A report is provided for the ACM SIGKDD community about the 2010 Workshop on Algorithms for Modern Massive Data Sets (MMDS 2010), its origin in MMDS 2006 and MMDS 2008, and future...

Michael W. Mahoney

claim paper

Read More »

34

click to vote

TC
2008

109views Information Technology» more TC 2008»

Optimal and Practical Algorithms for Sorting on the PDM

14 years 2 months ago

Download www.cse.iitd.ernet.in

Abstract. The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the processing of massive data sets. Sorting has been extensively studied ...

Sanguthevar Rajasekaran, Sandeep Sen

claim paper

Read More »

37

click to vote

CORR
2007
Springer

88views Education» more CORR 2007»

Faster subsequence recognition in compressed strings

14 years 2 months ago

Download www.dcs.warwick.ac.uk

Abstract. Processing compressed strings without decompression is often essential when dealing with massive data sets. We consider local subsequence recognition problems on strings ...

Alexandre Tiskin

claim paper

Read More »

41

click to vote

IVS
2008

138views more IVS 2008»

Extending the attribute explorer to support professional team-sport analysis

14 years 2 months ago

Download www.palgrave-journals.com

Advances in interactive systems and the ability to manage increasing amounts of high-dimensional data provide new opportunities in numerous domains. Information visualization tech...

Pär-Anders Albinsson, Dennis Andersson

claim paper

Read More »

43

click to vote

ICML
2010
IEEE

217views Machine Learning» more ICML 2010»

Budgeted Nonparametric Learning from Data Streams

14 years 3 months ago

Download www.cs.caltech.edu

We consider the problem of extracting informative exemplars from a data stream. Examples of this problem include exemplarbased clustering and nonparametric inference such as Gauss...

Ryan Gomes, Andreas Krause

claim paper

Read More »

32

click to vote

EDBTW
2006
Springer

115views Software Engineering» more EDBTW 2006»

Constructing Optimal Wavelet Synopses

14 years 6 months ago

Download www.dbnet.ece.ntua.gr

The wavelet decomposition is a proven tool for constructing concise synopses of massive data sets and rapid changing data streams, which can be used to obtain fast approximate, wit...

Dimitris Sacharidis

claim paper

Read More »

47

click to vote

SPIRE
2009
Springer

112views Information Technology» more SPIRE 2009»

Sketching Algorithms for Approximating Rank Correlations in Collaborative Filtering Systems

14 years 9 months ago

Download research.microsoft.com

Collaborative ﬁltering (CF) shares information between users to provide each with recommendations. Previous work suggests using sketching techniques to handle massive data sets i...

Yoram Bachrach, Ralf Herbrich, Ely Porat

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers