Search Sciweavers | Sciweavers

471 search results - page 11 / 95

» MapReduce: Simplified Data Processing on Large Clusters

click to vote

SC
2009
ACM

277views Applied Computing» more SC 2009»

Kepler + Hadoop: a general architecture facilitating data-intensive applications in scientific workflow systems

14 years 2 months ago

Download users.sdsc.edu

MapReduce provides a parallel and scalable programming model for data-intensive business and scientific applications. MapReduce and its de facto open source project, called Hadoop...

Jianwu Wang, Daniel Crawl, Ilkay Altintas

claim paper

Read More »

click to vote

CLOUDCOM
2010
Springer

241views Distributed And Parallel Com...» more CLOUDCOM 2010»

Efficient Metadata Generation to Enable Interactive Data Discovery over Large-Scale Scientific Data Collections

13 years 5 months ago

Download granules.cs.colostate.edu

Discovering the correct dataset efficiently is critical for computations and effective simulations in scientific experiments. In contrast to searching web documents over the Intern...

Sangmi Lee Pallickara, Shrideep Pallickara, Milija...

claim paper

Read More »

click to vote

DASFAA
2009
IEEE

115views Database» more DASFAA 2009»

TRUSTER: TRajectory Data Processing on ClUSTERs

14 years 2 months ago

Download homepage.fudan.edu.cn

With the continued advancements in location-based services involved infrastructures, large amount of time-based location data are quickly accumulated. Distributed processing techni...

Bin Yang 0002, Qiang Ma, Weining Qian, Aoying Zhou

claim paper

Read More »

click to vote

BMCBI
2010

151views more BMCBI 2010»

BABAR: an R package to simplify the normalisation of common reference design microarray-based transcriptomic datasets

13 years 7 months ago

Download www.biomedcentral.com

Background: The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design ...

Mark J. Alston, John Seers, Jay C. D. Hinton, Sach...

claim paper

Read More »

click to vote

KDD
2002
ACM

138views Data Mining» more KDD 2002»

Learning to match and cluster large high-dimensional data sets for data integration

14 years 8 months ago

Download www.cs.cmu.edu

Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...

William W. Cohen, Jacob Richman

claim paper

Read More »

« Prev « First page 11 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers