GrayWulf: Scalable Software Architecture for Data Intensive Computing

16 years 1 months ago

Download csdl2.computer.org

Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf . These services are intended for deployment on a cluster of commodity servers similar to the well-known Beowulf clusters. We use the Pan-STARRS system currently under development as an example of the architecture and principles in action.

Yogesh Simmhan, Roger S. Barga, Catharine van Inge

Real-time Traffic

Biometrics | Cluster Infrastructure Software | HICSS 2009 | Parallel Application Design | Petabyte Data Sets | System Sciences |

claim paper

» Software Architecture for LargeScale Distributed DataIntensive Systems

» Biocompute towards a collaborative workspace for data intensive bioscience

» GDIA A Scalable Grid Infrastructure for Data Intensive Applications

» Performance impact of proxies in data intensive clientserver applications

» Performance Implications of Architectural and Software Techniques on IOIntensive Applicati...

» A Framework for DataIntensive Computing with Cloud Bursting

» The Virtual Data Grid A New Model and Architecture for DataIntensive Collaboration

» A Service for DataIntensive Computations on Virtual Clusters

Post Info
More Details (n/a)

Added	19 May 2010
Updated	19 May 2010
Type	Conference
Year	2009
Where	HICSS
Authors	Yogesh Simmhan, Roger S. Barga, Catharine van Ingen, María A. Nieto-Santisteban, Laszlo Dobos, Nolan Li, Michael Shipway, Alexander S. Szalay, Sue Werner, Jim Heasley

Comments (0)

Sciweavers

GrayWulf: Scalable Software Architecture for Data Intensive Computing

Biometrics | Cluster Infrastructure Software | HICSS 2009 | Parallel Application Design | Petabyte Data Sets | System Sciences |

Explore & Download

Productivity Tools

Sciweavers