Sciweavers

2716 search results - page 59 / 544
» Integrating Performance Monitoring and Communication in Para...
Sort
View
145
Voted
HPDC
2010
IEEE
15 years 4 months ago
Scalability of communicators and groups in MPI
As the number of cores inside compute clusters continues to grow, the scalability of MPI (Message Passing Interface) is important to ensure that programs can continue to execute o...
Humaira Kamal, Seyed M. Mirtaheri, Alan Wagner
127
Voted
ISCAS
2007
IEEE
172views Hardware» more  ISCAS 2007»
15 years 10 months ago
A 3D Integrated Feature-Extracting Image Sensor
Abstract— In this paper we present a feature-extracting image sensor targeted to wireless image sensor networks. The image sensor was designed and fabricated on a 3D integrated 0...
Zhengming Fu, Eugenio Culurciello
137
Voted
APCSAC
2006
IEEE
15 years 9 months ago
A High Performance Simulator System for a Multiprocessor System Based on a Multi-way Cluster
In the ubiquitous era, it is necessary to research the architectures of multiprocessor system with high performance and low power consumption. A simulator developed in high level l...
Arata Shinozaki, Masatoshi Shima, Minyi Guo, Mitsu...
129
Voted
PPOPP
2005
ACM
15 years 9 months ago
Performance modeling and optimization of parallel out-of-core tensor contractions
The Tensor Contraction Engine (TCE) is a domain-specific compiler for implementing complex tensor contraction expressions arising in quantum chemistry applications modeling elect...
Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J...
132
Voted
HPCA
2002
IEEE
16 years 4 months ago
Evaluation of a Multithreaded Architecture for Cellular Computing
Cyclops is a new architecture for high performance parallel computers being developed at the IBM T. J. Watson Research Center. The basic cell of this architecture is a single-chip...
Calin Cascaval, José G. Castaños, Lu...