Sciweavers

SC
2015
ACM
8 years 3 months ago
Student cluster competition: a multi-disciplinary undergraduate HPC educational tool
National labs, academic institutions and industry have a strong need for scientists and staff that understand high performance computing (HPC) and the complex interconnections ac...
Stephen Lien Harrell, Hai Ah Nam, Verónica ...
SC
2015
ACM
8 years 3 months ago
Infrastructure for In Situ System Monitoring and Application Data Analysis
We present an architecture for high-performance computers that integrates in situ analysis of hardware and system monitoring data with application-specific data to reduce applica...
Jim M. Brandt, Karen D. Devine, Ann C. Gentile
SC
2015
ACM
8 years 3 months ago
HPC I/O trace extrapolation
Today’s rapid development of supercomputers has caused I/O performance to become a major performance bottleneck for many scientific applications. Trace analysis tools have thus...
Xiaoqing Luo, Frank Mueller, Philip H. Carns, John...
SC
2015
ACM
8 years 3 months ago
Mantle: a programmable metadata load balancer for the ceph file system
Migrating resources is a useful tool for balancing load in a distributed system, but it is difficult to determine when to move resources, where to move resources, and how much of ...
Michael A. Sevilla, Noah Watkins, Carlos Maltzahn,...
SC
2015
ACM
8 years 3 months ago
PL2AP: fast parallel cosine similarity search
Solving the AllPairs similarity search problem entails finding all pairs of vectors in a high dimensional sparse dataset that have a similarity value higher than a given threshol...
David C. Anastasiu, George Karypis
SC
2015
ACM
8 years 3 months ago
Fault tolerant MapReduce-MPI for HPC clusters
Building MapReduce applications using the Message-Passing Interface (MPI) enables us to exploit the performance of large HPC clusters for big data analytics. However, due to the l...
SC
2015
ACM
8 years 3 months ago
Lessons Learned from Building In Situ Coupling Frameworks
Over the past few years, the increasing amounts of data produced by large-scale simulations have motivated a shift from traditional offline data analysis to in situ analysis and v...
Matthieu Dorier, Matthieu Dreher, Tom Peterka, Jus...
SC
2015
ACM
8 years 3 months ago
Performance and productivity of parallel python programming: a study with a CFD test case
The programming language Python is widely used to create rapidly compact software. However, compared to low-level programming languages like C or Fortran low performance is preven...
Achim Basermann, Melven Röhrig-Zöllner, ...
SC
2015
ACM
8 years 3 months ago
Comparative I/O workload characterization of two leadership class storage clusters
The Oak Ridge Leadership Computing Facility (OLCF) is a leader in large-scale parallel file system development, design, deployment and continuous operation. For the last decade, ...
Raghul Gunasekaran, Sarp Oral, Jason Hill, Ross Mi...
SC
2015
ACM
8 years 3 months ago
Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG
The last decade has seen power consumption move from an afterthought to the foremost design constraint of new supercomputers. Measuring the power of a supercomputer can be a daunt...