Sciweavers

1301 search results - page 198 / 261
» Performance Analysis of Parallel Job Scheduling in Distribut...
Sort
View
IPPS
1999
IEEE
14 years 1 months ago
Run-Time Selection of Block Size in Pipelined Parallel Programs
Parallelizing compiler technology has improved in recent years. One area in which compilers have made progress is in handling DOACROSS loops, where crossprocessor data dependencie...
David K. Lowenthal, Michael James
EUROPAR
2003
Springer
14 years 2 months ago
Distributed Application Monitoring for Clustered SMP Architectures
Abstract. Performance analysis for terascale computing requires a combination of new concepts including distribution, on-line processing and automation. As a foundation for tools r...
Karl Fürlinger, Michael Gerndt
IEEEPACT
2000
IEEE
14 years 1 months ago
aSOC: A Scalable, Single-Chip Communications Architecture
As on-chip integration matures, single-chip system designers must not only be concerned with component-level issues such as performance and power, but also with onchip system-leve...
Jian Liang, Sriram Swaminathan, Russell Tessier
ICPP
1998
IEEE
14 years 1 months ago
A memory-layout oriented run-time technique for locality optimization
Exploiting locality at run-time is a complementary approach to a compiler approach for those applications with dynamic memory access patterns. This paper proposes a memory-layout ...
Yong Yan, Xiaodong Zhang, Zhao Zhang
IPPS
2007
IEEE
14 years 3 months ago
United-FS: A Logical File System Providing a Single Image of Multiple Physical File Systems on NFS Server
NFS is considered to be the bottleneck in cluster computing environment because of its limited resources and centralized data management. With the development of hardware, NFS ser...
Huan Chen, Yi Zhao, Jin Xiong, Jie Ma, Ninghui Sun