Search Sciweavers | Sciweavers

471 search results - page 14 / 95

» MapReduce: Simplified Data Processing on Large Clusters

185

Voted

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 1 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

266

Voted

WWW
2009
ACM

449views Internet Technology» more WWW 2009»

Smart Miner: a new framework for mining large scale web usage data

16 years 7 months ago

Download www2009.org

In this paper, we propose a novel framework called SmartMiner for web usage mining problem which uses link information for producing accurate user sessions and frequent navigation...

Murat Ali Bayir, Ismail Hakki Toroslu, Ahmet Cosar...

claim paper

Read More »

224

click to vote

SIGMOD
2012
ACM

226views Database» more SIGMOD 2012»

SkewTune: mitigating skew in mapreduce applications

13 years 9 months ago

Download nuage.cs.washington.edu

We present an automatic skew mitigation approach for userdeﬁned MapReduce programs and present SkewTune, a system that implements this approach as a drop-in replacement for an e...

YongChul Kwon, Magdalena Balazinska, Bill Howe, Je...

claim paper

Read More »

183

click to vote

BTW
2007
Springer

140views Database» more BTW 2007»

SmurfPDMS: A Platform for Query Processing in Large-Scale PDMS

16 years 27 days ago

Download www.btw2007.de

: As Peer Data Management Systems (PDMS) are a focus of current research, there are lots of approaches like query processing or routing issues that have to be evaluated. Since ther...

Katja Hose, Christian Lemke, Jana Quasebarth, Kai-...

claim paper

Read More »

221

click to vote

KDD
2004
ACM

624views Data Mining» more KDD 2004»

Programming the K-means clustering algorithm in SQL

16 years 2 days ago

Download www.cs.uiuc.edu

Using SQL has not been considered an eﬃcient and feasible way to implement data mining algorithms. Although this is true for many data mining, machine learning and statistical a...

Carlos Ordonez

claim paper

Read More »

« Prev « First page 14 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers