Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P

15 years 1 months ago

Download www.mcs.anl.gov

— High-end computing (HEC) systems have passed the petaﬂop barrier and continue to move toward the next frontier of exascale computing. As companies and research institutes continue to work toward architecting these enormous systems, it is becoming increasingly clear that these systems will utilize a signiﬁcant amount of shared hardware between processing units, including shared caches, memory management engines, and network infrastructure. While these systems are optimized to use all of the hardware available in a dedicated manner to achieve the best performance, in practice, the shared nature of this hardware makes scheduling applications on it difﬁcult and wasteful. For example, while the IBM Blue Gene/P system has been designed to use a torus network for efﬁcient communication, some of the torus links (especially those connecting different racks) are shared between multiple racks. Thus, a job running on one rack, might preclude another job from running on a second rack in...

Narayan Desai, Darius Buntinas, Daniel Buettner, P

Real-time Traffic

Distributed And Parallel Computing | Enormous Systems | Exascale Computing | Hardware | ICPP 2009 |

claim paper

Post Info
More Details (n/a)

Added	23 May 2010
Updated	23 May 2010
Type	Conference
Year	2009
Where	ICPP
Authors	Narayan Desai, Darius Buntinas, Daniel Buettner, Pavan Balaji, Anthony Chan

Comments (0)

Sciweavers

Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P

Distributed And Parallel Computing | Enormous Systems | Exascale Computing | Hardware | ICPP 2009 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers