Scheduling DAGs with communication times is the theoretical basis for achieving efficient parallelism on distributed memory systems. We generalize Graham's task-level in a ma...
Generative parallel design patterns is a proven technique to improve the productivity of parallel program development. However many of the generative design-pattern systems are de...
High-end computing is universally recognized to be a strategic tool for leadership in science and technology. A significant portion of high-end computing is conducted on clusters...
In this paper we present the internal representation and optimizations used by the CASH compiler for improving the memory parallelism of pointer-based programs. CASH uses an SSA-b...
Full-chip thermal monitoring is an important and challenging issue in today’s microprocessor design. In this paper, we propose a new information-theoretic framework to quantitat...
Huapeng Zhou, Xin Li, Chen-Yong Cher, Eren Kursun,...