Sciweavers

826 search results - page 35 / 166
» Managing clusters of geographically distributed high-perform...
Sort
View
CLUSTER
2007
IEEE
14 years 3 months ago
Anomaly localization in large-scale clusters
— A critical problem facing by managing large-scale clusters is to identify the location of problems in a system in case of unusual events. As the scale of high performance compu...
Ziming Zheng, Yawei Li, Zhiling Lan
CCGRID
2007
IEEE
14 years 3 months ago
Standardization of an API for Distributed Resource Management Systems
Today’s cluster and grid environments demand the usage of product-specific APIs and tools for developing distributed applications. We give an overview of the Distributed Resour...
Peter Tröger, Hrabri Rajic, Andreas Haas, Pio...
SC
2009
ACM
14 years 3 months ago
Exploring many task computing in scientific workflows
One of the main advantages of using a scientific workflow management system (SWfMS) to orchestrate data flows among scientific activities is to control and register the whole work...
Eduardo S. Ogasawara, Daniel de Oliveira, Fernando...
WSC
2007
13 years 11 months ago
High-performance computing enables simulations to transform education
This paper presents the case that education in the 21st Century can only measure up to national needs if technologies developed in the simulation community, further enhanced by th...
Dan M. Davis, Thomas D. Gottschalk, Laurel K. Davi...
CCGRID
2008
IEEE
13 years 10 months ago
Fault Tolerance in Cluster Federations with O2P-CF
Fault tolerance is one of the key issues for large scale applications executed on high performance computing systems. In a cluster federation, clusters are gathered to provide hug...
Thomas Ropars, Christine Morin