Sciweavers

482 search results - page 21 / 97
» A large-scale study of failures in high-performance computin...
Sort
View
133
Voted
GRID
2008
Springer
15 years 4 months ago
Troubleshooting thousands of jobs on production grids using data mining techniques
Large scale production computing grids introduce new challenges in debugging and troubleshooting. A user that submits a workload consisting of tens of thousands of jobs to a grid ...
David A. Cieslak, Nitesh V. Chawla, Douglas Thain
162
Voted
ICPADS
1996
IEEE
15 years 7 months ago
Implementation of MAP: A system for mobile assistant programming
We have de ne a network programming model called Mobile Assistant Programming (MAP) for development and execution of communication applications in large scale networks of heteroge...
Stéphane Perret, Andrzej Duda
143
Voted
EICS
2009
ACM
15 years 10 months ago
Toward user interface virtualization: legacy applications and innovative interaction systems
Single-user, desktop-based computer applications are pervasive in our daily lives and work. The prospect of using these applications with innovative interaction systems, like mult...
Guillaume Besacier, Frédéric Vernier
113
Voted
SIGOPS
2010
144views more  SIGOPS 2010»
15 years 1 months ago
The case for a versatile storage system
Storage systems in emerging large-scale (a.k.a. peta-scale) computing systems often introduce a performance or scalability bottleneck. To deal with these limitations we propose a ...
Samer Al-Kiswany, Abdullah Gharaibeh, Matei Ripean...
INFOCOM
2005
IEEE
15 years 9 months ago
On failure detection algorithms in overlay networks
— One of the key reasons overlay networks are seen as an excellent platform for large scale distributed systems is their resilience in the presence of node failures. This resilie...
Shelley Zhuang, Dennis Geels, Ion Stoica, Randy H....