Sciweavers

420 search results - page 62 / 84
» Scalable Parallel Programming with CUDA
Sort
View
IPPS
1996
IEEE
14 years 18 days ago
A TeraFLOP Supercomputer in 1996: The ASCI TFLOP System
To maintain the integrity of the US nuclear stockpile without detonating nuclear weapons, the DOE needs the results of computer-simulations that overwhelm the world's most po...
Timothy G. Mattson, David Scott, Stephen R. Wheat
HPCN
1994
Springer
14 years 16 days ago
Experiments with HPF Compilation for a Network of Workstations
Abstract. High Performance Fortran (hpf) is a data-parallel Fortran for Distributed Memory Multiprocessors. Hpf provides an interesting programming model but compilers are yet to c...
Fabien Coelho
PPOPP
2003
ACM
14 years 1 months ago
Optimizing data aggregation for cluster-based internet services
Large-scale cluster-based Internet services often host partitioned datasets to provide incremental scalability. The aggregation of results produced from multiple partitions is a f...
Lingkun Chu, Hong Tang, Tao Yang, Kai Shen
PPOPP
2009
ACM
14 years 9 months ago
An efficient transactional memory algorithm for computing minimum spanning forest of sparse graphs
Due to power wall, memory wall, and ILP wall, we are facing the end of ever increasing single-threaded performance. For this reason, multicore and manycore processors are arising ...
Seunghwa Kang, David A. Bader
MIDDLEWARE
2005
Springer
14 years 1 months ago
Matrix: Adaptive Middleware for Distributed Multiplayer Games
Building a distributed middleware infrastructure that provides the low latency required for massively multiplayer games while still maintaining consistency is non-trivial. Previous...
Rajesh Krishna Balan, Maria Ebling, Paul Castro, A...